NASA CR- 


Systems Analysis Of The Space Shuttle 
Final Report 

April 17, 1974 - April 18, 1975 

National Aeronautics and Space Administration 
Johnson Space Center - Houston 
imder 

NASA Contract NAS 9 - 13940 


Department of Electrical Engineering 


(NASA-CR-1 41887) SYSTEMS ANALYSIS OF THE 
SPACE SHUTTLE Final Report, 17 Apr. 1974 - 
18 Apr. 1975 (City Coll, of the City Univ. 
of New York.) 196 p HC $7.00 CSCL 22B 

63/ Id 


N75-25997 


Unclas 

27302 


Donald L. Schilling Se Jexing Oh Fred Thau 
Professors of Electrical Engineering; Co-Principal Investigators 




% 









^ 


THE CITY COLLEGE of 
THE CITY UNIVERSITY of NEW YORK 


Systems Analysis Of The Space Shuttle 
Final Report 


A]>ril 1C, 197d - April 17, 1075 

National Aeronautics and Space Administration 
Johnson Space Center - Houston 
under 

NASA Contract NAS 9 - 13940 
Department of Electrical Engineering 


Donald L. Schilling Se Jeung Oh Fred Thau 

Professors of Electrical Engineering; Co- Principal Investigators 



Table of Contonts 


I 


Introduction 

I. Communicatious System 

lo Digital Processing of Video Signals 

2. Entropy Encoding of a Delta Modulator Output for Bandwidth Compression 

3. Proccsssing and Caiversion of Delta Modulation Encoded Signals 

4. A Phase Locked Loop with NonLinear Processor 

5. The Measurement of light Intensity of Computer Generated Pictures 

n. Computer System 

1. Testability Enhancement in Digital System Design 

2. Parallelism Exploitation in Parallel Computing Systems 

3. An Ejq) elemental Simultaneous Multiprocessor Organization 

nio Power Distribution System * 

1. Simulation Program Description for Space-Shuttle Electric 
Power Distribution System 

2, Automatic Fault Detection 


ii 


Introduction 


Tills report consists of three aspects of the Shuttle Systems Analysis problem: 
Communications, Computers aud Power Distribution. Part I summarizes the 
Comimmieations re, search vflix'ected by Professor D. L. Schilling; Part II • 
simimarizes the Computer research directed by Professor S. J. Oh and 
P:irt ni summarizes the I'esearch on Power Distribution directed by Professor f 
F. Thau. 

In addition to tlie three eoprincipal investigators tliis contract partially 
suppox’ted Dr. C. B, Park and tlie following Doctoral Students : T.Appslewicz, 

C. S. Chuang, E. Feria, R, Lei, J. LoCicero, G. S. Mersten, V, Rao, 

N. Scheinberg, D. Ueci, L„ Weiss and C. Zeigler. 

Several papers, summarizing partieular aspects of the research . 

conducted under this contract, were presented at Gonferenees, and the 
paper, "Video Encoding Using an Adaptive Digital Delta Modulator" by 
L. Weiss, 1. Paz, andD. L. Schilling has been accepted for publication as a 
TcchniGal paper in the IEEE Transactions on Communications. 
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L 1» Digital Pyoceasing of Video Signals 

Ao High Speed Frame to Frame Delta Modulation 

Successive frames of a motion picture contain redundant informationo The 
amount of redundancy depends upon the degree of change in the scene from frame 
to frameo We are currently building a high speed delta modulator which encodes 
television signals in such a way as to use the frame to frame redundancy of 
motion pictures to reduce the bit rate required to transmit a television signal. 

Figure 1 shows the basic encoding scheme. Each large square represents 
a successive frame of a motion picture. Each dot on a frame represents a 
pixel (pictmze element). Note that each pixel has associated with it a Delta 
Modulator (A MOD) that follows the same pixel through successive frames. 

Thus the A MOD in the upper most left hand corner always encodes the pixel 
in the upper .most left hand corner for every frame. 

Bit rate compression is achieved in the following manner. From pre\'ious 
studies of delta modulators it has been observed that high sampling rates are 
required for a delta modulator to accurately encode rapidly changing signals 
and low sampling rates may be used on slowly varying signals. The usual 
method of encoding a picture by Sampling successive pixels within the same 
frame results in rapidly changing signals which require a high delta modula- 
tor bit rate (typically# 6 bits per pixel) . With the feame to frame eueoding 
technique of Fig. 1 each delta modulator is associated with its own pixel, hi 
general the pixel value will change slov/ly from frame to frame, allowing the 
delta modulator to accurately encode the slowly changing pixel value at a low 
bit rate. Using this scheme motion pietoes will be encoded at 1 bit per pixel. 

At first thought, hardware implementation of Fig. 1 may seem impossible 
since, at 1 delta modulator per pixel, a typical 20Q 000 pixels/frame 3 MHz 
bandwidth TV system would have to contain 200,000 delta modulators each 
operating at a 6MHz sampling rate. Fortunately only 1 6MHz delta modulator 
need be used to implement Fig. 1;' however, this single delta modulator will 
have to contain enougli shifi: register type memory to store an entire picture 
frame. 




The reasoa that 1 delta modulator can replace all the delta modulators of 
Fig. . 1 with no increase in operating speed is simple. Si Fig. 1 were Imple- 
mented as shown all 200, 000 pixels could be encoded, decoded and displayed in 
parrallelj but a real time television system requires only due pixel at a time in 
serial. The retention time of the eye and screen give the appearance of a full 
picture. We may taJ?e advantage of this by starting a single delta modulator at . 
the upper most left hand eorner of a frame, let it encode that pixel based upon 
its previous estimate of the pixel from the previous frame, transmit a bit (Ej^), 
store its new estimate for that pixel, and then repeat the process for the 
next adjacent pixel in the same frame. The delta modulator will continue to 
encode, and transmit for each adjacent pixel in turn, until the delta modulator 
has been multiplexed through the entire frame (typically 1/30 sec). Then the 
delta modulator will return to the first pixel and repeat the process for the next 
frame. 

Figure 2 shows the hardware implementation of the high speed video delta 
modulator. The arithmetic and logic portions are being assembled out of 
Schottlsy TTL devices. The 200, 000 bit shift registers will either be assembled 
out of 4K dynamic Random Access Memories (RAM’s) such as the Intel 2107 or 
Intel's new 16K Charge Coupled Device (CCD) serial memories. The CCD 
devices are superior to tne RAM’s but the RAM’s may have to be xised if the 
CCD devices cannot be obtained. H the 4K RAM’s are used, spedial multi- 
plexing and demultiplexing lo^c will have to be used with them to make the 
relatively slow 4K RAM’s appear as high speed shift registers. The logic con- 
figuration for the 4K RAM's converted into shift registers is sho'ivn in Fig. 3. 

The addressing logic (not shown) consists of 8 twelve-stage counters counting 
the pulses of lines A, B, G, D, A, B , C, andD. Each counter output feeds 
the address lines of a 4K RAM, 

Construction of the high speed delta modulator without the large shift register 
memory is complete. Testing of the delta modulator has just begun. 



Bo Comyiuter Processor for Color Video Signals 

We are assembling a computer processor for color video signals « The 
processor will enable us to feed color video signals from a color TV camera 
into a computero The computer will be able to process the video signals and 
display a picture on a video monitor. 

The basic system is shown in Fig, 4, An image originatiaating in a 35 
mm slide piojector is focused on a color TV camera. The TV camera has 
three video outputs 5 one for the red component of tlie im^e, one for the green 
component of the image, and the third for the blue component of the image. 

Each of the three video outputs is put into a scan converter which stores a 
single picture frame in its electrostatxc storage tube. Upc n receiving com- 
mands from the computer the scan converter transfers the stored picture into 
the computer. The rate at which the scan converter sends information to the 
computer is controlled by the computer. The computer can also randomly 
access the Information stored in the scan eonverter, i, e, , the computer can 
read the picture elements info its memory in whatever order the computer 
program requires. After the computer proeesses the picture, tne picture is 
read out of the computer and stored on three scan converters, one for the red 
component, one for the green component and one for the blue component of the 
picture. Again the computer controls the rate and order in wMch the picture 
elements are stored in the scan converter. The scan converters are read out 
onto a TV monitor which displays tlie pieture. Since reading of tlie sean con- 
verters is nondestructive and occurs at 30 frames per second, the user of the 
system sees the picture frame frozen pa the TV monitor. The picture eau be 
kept frozen for several minutes and then photographed for a permanent record. 

When the system described above is completed we will have a very flexable 
research tool. By simply ehanglng computer programs the system can perform 
image enliancement, bandwidtli eompression or any other type of image processing 
one -might want to perform on color pictures . 
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L 2. Entropy Encoding of a Delta Modulator Output for Bandwidth Compression 
Ao Abstract 

The output bit stream of the Abate iWode Adaptive Delta Modulator with voice 
soimce input has been studied. It is foimd tliat about 50 per cent of the bits are 
in a steady state sequence, "1100”, when the input voltages are relatively low. 
Hence, the output bit stream can be coded such tliat the final bit rate and band- 
width are reduced. Two Idnds of coding teclmiques, the Huffman coding and run 
length coding, have been considered. The Huffman coding is found to be superior 
to the run length coding. However, the direct Huffman coding algorithm, which is 
an information preserving code, does not result in a high data compression ratio. 
The modified H uffm an coding scheme, an information degrading process, is 
currently being studied. 

Bo Introduction 
1, Historical Background 

Source-encoding or data compression techniques are used to reduce the 
volume of data generated. As a result, the bandwidth of the eliaunel over which 
the data is transmitted is reduced. There are two lands of data compression 
techniques : information preserving and information degrading. When tlie in- 
formation preserving process is used for coding, all of tlie information presented 
before ‘coding can be regenerated. Wlaeras, when the information degrading 
process is used, only part of the original information can be regenerated. How- 
ever, for the information degrading process, a fidelity criterion can be chosen 
such that certain bits of information can be ignored in order to obtain a better 
result in data compression and still be able to retain a relatively high intelligi- ' 
bi^ily. 

Source-encoding systems can be classified as either "variable to block" or 
"block to variable", A system accepting a vaxiable-lengtli sequence of digits 
from the source and generating a fixed-length block code word is called a 
"variable to block" encoder. A system accepting a fix-length block of digits 



9) 


ftrom tlie source and generating a variable- length block code word is called a 
"block to variable" eucodero 

A binary memoryless system generating a statisticallj^ independent source 
alphabet, (0, 1), can be chai’acterized by the probability, P and (1 - P). Pis 
defined as tlie probability of a digit "1" being emitted, and (1 - P) is defined 
as the probability of a digit "0" beii^ emitted. The entropy of the source is 
defined as 


H‘(p) = -PloggP “ (1 - PlloggCl - P). ‘ (1) 

and is a measure of tlie information content of the source. 

If a binary memoryless digital system has n possible values, (x-, x„, , , , ,x 

X a u 

to be encoded with the pi obabilities , (P, , P_, « P ) respectively. Then, 

X ^ tx 

the entropy of tlie source is 

n 

H(p)=- D PjloggP-. (2) 

i=l 

For example, = 00, Xg = 01, = 10, x^ = 11, and P^ = 1/8, Pg = 1/8, 

Pg = l/4, P^ = l/2, Then using Eq. (2), 

H(p) = 1. 75 

In the above example, code words are not assigned to single digits but tobloelcs of 
digits. Di other words, the encoder waits for the source to produce a blodc of 
two digits, then assigns a code word to the two-digit block. 

According to Shannon's noiseless coding tlieorem (1), the minimum average 
code-word length is equal to H(p). The fiieorem is stated as folloivs : Given a 
random variable, x, in a bbiarj'^ system with entropy H(x), there exists a code 
for X whose average code-word length, n , satisfies 
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H(x)< n‘< H (X) + 1, 


(3) 


If a long soxu’ce sequence of N blocks is the input to an optimal source 
encoder, the output ulll be a sequence of NH(p) bits. In the previous example, 
since each bloc]c consists of 2 digits, the total length of the input sequence 
is 2 n bits. The output sequence length is 1. 75N bits. The maximum data 
compression ratio is, then, 

^“1.75N 

The maximum data compression ratio depends on the statistics of the 
encoding source. In other words, once the probabilities of occurrance of 
the different patterns of the source digits are known, the maximum data 
compression ratio is theoretically determined, 

2. Huffman Coding 

The first optimal coding scheme developed was the Huffman coding (2). 

This is a 'block to variable" technique. When Huffman coding is applied 

to a binary memoryless source, the source sequence has to be broken up 

N 

into blocks of N bits long. Then, each block contains one of M = 2 possible 
messages. The least probably message is assigned a code word containing 
the longest sequence of bits. While the more probable messages are assigned 
code words of shorter length. 

3. Run Lengtli Coding 

Run length coding (3, 4) technique is "variable to block". It works well 
when probability P is either very low or very high. In the case of low 
probability P, the number of consecutive zeros are counted and transmitted 
as a block of binary digits. Di the case of liigh probability P, the number of 
consecutive ones are coimted and transmitted. The optimal compression 
ratio as a function of P is sho^vn in Fig, 1. 



Co Theoretical Considerations of Source Encoding of an Abate Mode 
Adaptive Delta Modulator 

For a DC input voltage, tlie Abate Mode Adaptive Delta Modulator has a 
steady state oxtput pattern as shown in Figo 2» It is loiowu that during speeches 
or telephone conversations, there are many short period of pauses. In the 
noiseless case, this pauses will be considered as DC voltage by the delta modu- 
lator and a stream of steady state pattern will be generated. Also when signsd 
voltage and signal frequencies are low, the output bit stream of Adaptive Delta 
Modulator wll be viewed as segments of steady state patterns corrupted by 
some noise bits, as shown in Fig. 3, 

The length and frequency of the steady state pattern depend upon factors, 
such as input signal voltage, minimum, step size of the Adaptive Delta Modula- 
tor, sampling rate, and signal bandwidth. The run length coding scheme will 
be used if the steady state pattern is very long. When the Steady state pattern 
is not very long but its probability of occurrance out of tlie entire bit stream is 
very high, the Huffman coding scheme should be used. 

The steady state pattern is as follows ; 

0 . 0 .,. 1100110011001100 , . . 0 , . 

If a block of 4 is chosen for the Huffman code, tlie steady state pattern can be 
broken up into four different steady state sequences, 1100, 1001, 0110, and 
0011. If a block of 6 is chosen, then the sequence will be llOOll, lOOllO, 
001100, 011001. 

In the next seetion, the experimental results will show that blocks of 4 
are better than bloclm of 6 in the sense that tlie maximum data compression 
ratio is Mgher for blocks of 4. The reason is obviously because the steady 
state pattern is a repetition of a 4 bit sequence. Higher bit blocks will 
drastically increase tlie number of sequences to be coded thus decreasing 
the ratio of steady state sequences to otlier sequences. 

D, Summary of Hesults 
1. Experimental Results 

In order to determine whether the rxm length code or tlie Huffman code 
is better for the Abate Mode Adaptive Delta Modulator, we must Icnow how 



long a run of tlie steady state pattern is. The average run length of the steady 
state pattern for this experiment was ealeulated using a PDP-8 computero 
Figure 4 shows the procedure employed to obtain this statistics o Since it 
is desired to have the computer simulated Adaptive Delta Modulator work 
in real time, the sampling frequency is, therefore, determined by the lengtii 
of the computer program. In other words, after each sample value have 
been transferred to the computer from the A/D converter, the computer 
will go through the program's process of generation the output bit and 
analyzing the statistics before it goes back to take another sample. Since 
the PDP-8 is not a high speed computer, the fastest sampling rate we can 
manage to obtain is about SKHz. However, a sampling rate of 32K is 
standard for Delta Modulators. Therefore, a process of slowing down 
the speed of the source tape is necessary. By alternatively playing back 
and recording at different speed mth two high quaiily.Ampex tape recorder, 

a voice tape of 1/8 of the normal speed was suecessfullj'' made. Tliis 

til 

1/8 speed voice tape was originally part of a magnetic open reel tape 
recorded by the Amp^ Company with Ed Begley reading the storj’^ of 
Mark Twain, • This tape was used as the voice source for all tlie experi- 
ments in this report. 

All of the programs are written in proper length such that the sampling 
rate of 4ICHz becomes equivalent to 32lCHz at normal speed. The digital 

A 

estimated signal, x (It), of the simulated Adaptive Delta Modulator was. Srst 
fed into a D/A converter; then the analog estimated signal x(t) was generated 
and recorded into a tape recorder. This magnetic tape then went through 
the reverse process of slowing down, previously described, to return it 
to the normal speed. Finally, this tape’s intelligibility vjrs determined by 
playiug it back from a recorder. While tlie estimated signal x(t) was being 
recorded, the computer was doing statistical computations and storing the 
results in its memory. The results are compiled for a total of 20 minutes, a 
time interval wMch .we believe is long enough to collect unbiased statistics. 

The average run length of the steaefy state pattern is listed in Table 1. 



Note that the average run length is not very long. A run of 2 or 3 is definitely 
too short for a run length code? .therefore» the use of a run lengtli code is 
ruled out. 


The output bit sequence of the Adaptive Delta Modulator is intriusl'^ 
caly dependent on tlie slope of the input signal. Increasing the minimum 


step size, S^, wall have the same effect as decreasing tlie input signal level. 
The following factors were chosen to perform tlie experiment : 


(1) Voiee S^als are bandlimited to within the range of 300 Hz to 2. 5KHz. 

(2) Sampling frequency f , is adjusted to 32IdiZo Some other different 

' s 

sampling frequencies are also used for comparison purpose. 

(3) The minimum step size, is set to be 40 rav. 

(4) The maximum peaJc to peaiv input voltage level is varied in steps from 


P-P 


to 20 V » 
P-P 


In order to apply the Huffman coding technique to the o utput bit stream 
of the Adaptive Delta Modulater, a computer program is written to search 


for the steady smte sequences. When blocks of 4 bits are used to form the 
code word, the '1100’ sequence was being searched for from the entire out- 
put bit sfa:eam. Whereas, when blocks of 6 bits are used as in the Huffman 
coding block, the ’llOOil' sequence was being searched for. The results 
are shown in Table 2 and Table 3 respeetively. It is clear that blocks of 


4 are better for coding than blocks of 6, and low input voltage level have 
shorter average code word length. It is also found that m the ease pf 


f - 32KHz, S_ == 40 mv, the input voltage level of 2 volts pealc-to-pealc 
s u 

or higher results in veiy high inteiiegibility. An input voltage level of 1 
volt pealc-to-rpe^ or lower results in poor performance due to unfiltered 
in-band ^anular noise. 

By referring to Table 2 , it can be seen tliat at 2 v^^^^ input' signal level, 
53 percent of the total bit stream are in tlie steady state sequenee ;md yet, 
we stiH maintain high intelligibility. These results encourage us to do 
furtlier studies on tlie probability distributions of 16 different sequences. 
Tills data is listed in Table 4. In order to give a better visual compax’ison. 



they are plotted in Mg. 5 through Mg, 9. Prom Mg. 5 and Fig, 6, it can 
be seen tliat the probability of occurrance of steady state sequences (0011, 

0110, 1001, 1100) are much higher than tlie others. This fulfills the 
fundamental requirements of the Huffman coding scheme. It also can be 
seen, by referring to Pig. 7 and Mg. 8, that the probability of the steady 
state sequences are not exceedingly higher than the others when the inpit 
voltage level is increased. In the extreme case of 20 V , as shown in 

p-p 

Fig. 9, the probability of steady state sequences are even less than some 
other sequences. Hence, a number of general conclusions can be drawn 
ttom these results ; 

(1) Blocks of 4 should be used to Construct the H\i£fman code. 

(2) As far as signal to noise ratio is concerned, the performance of the 
Delta Modulator is very good when the input voltage is 2 V or higher. 

p-p 

(3) At low input voltage, the resulting probability distributions Eu*e most 
suitable for coding. 

(4) Prom statements (2) and (3) above, it is clear that the probability 

distribution for 2 ^ input voltage, as referred to in Mg. 6, should 

irjr 

be used for tlie coding scheme. 

2. Information Preserving Coding 

The entropy of the coding source is a measure of the information content 
of the source and is the lower bound of the code word length. Table 5 shows 
the entropy of the source at different input voltages. It was foimd that at an 
input voltage of 2 V the minimum obtainable code word length is 3, 56, 

p— p 

which gives very little data eomiDression. The actual code word length com- 
puted after the code has been constructed, as shown in Fig 10, is 3,598, The 
data compression ratio is 4/3. 598 = 1.112, which is certainly not satisfactory. 
The reasons are twofold : 

(1) The four most probably steady state sequences are coded separately, 
consequently, three digits are needed to code each sequence. However, 
if these 4 most probably sequences can be represented by one sequence 
with the probability 4 times higher, then a single digit is enough to code 
tins sequence and a tremendous savings in code word lengfcli can be expected 

(2) As it can ]je seen, from Fig. 6 and Fig. 10, the probabilities of sequences. 



'OOOO's ’llll'i *0101’, and *1010* are much lower than the other sequences. 

Thus, they can be neglected without sigoifLcantly degrading tlie performance, 

Tlie 6 digit code word will not exist if these 4 sequences are neglected. There- 
fore, a further shortening in code word length can be again ejected. 

3, Information Degrading Coding 

The probability of occurrence of the "0000" sequences is very low as shown 
in Figo 6, The encoder can be designed to receive tliis sequence and to send 
out the more probable sequence, "0100". Similarly, the "1111", "1010" and 
"0101" sequences can be converted into the "1011" or "01 00" sequences as shown 
in Fig. 11, Puthermore, if a sliding block coding algorithm is used , there would 
be only one steady state sequence "1100", with probability higher than 50%. If 
we assume that the 4 steady state sequences could be represented by one sequence, 
the final total number of sequences to he coded would then be reduced to 9. The 
average code word length would thus be reduced to 2, 228, as shown in Fig. 11, 

The performance of this Information degTading process is not yet determined. 

The complete algoritlim describing this process is currently being researched. 
However, the theorelicaliy computed value, 2,228, gives us some valuable in- 
dications that the coding of the Delta Modulator foi' bandwidth compression is 
feasible, 

E, Co nGl usions 

A thorough study of the statistxcs of the output bit stream of the Abate Mode 
Adaptive Delta Modulator has been parformed, A few conelusious can be drawn; 
they are stated as follows ; 

(1) The "bloek-to-variable'’ coding scheme should be used for the Adaptive Delta 
Modulator. The opfchnum block lengtli is four bits. 

(2) The direct Huffman coding technique does not result in a Iiigh data compression 
ratio. Hence, a modified iiiformation-nonpreserving coding scheme based on 
the Huffman method should be derived to meet the theoretical bound. 


Further studies are still to be done in this area# Some theoretical cal- 
culations for the information degrading scheme is needed. Transformation 
methods, such as, Fourier transformation, Hadamard transformation, 
Karhunen-Loeve transformation, and others, are alternate approaches to 
reduce the fi n al output bit rate and channel bandwidth. The video signal 
source using tile Song Video Mode Adaptive Delta Modulator has not been 
analized at the time this report is written. Further ejqpansion from an 
audio source to a video source is the next planned research. 
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Average Number of Buns of Stead|y State Sequences at Different Input Voltages 
Sequences Input Voltages 



1 V 
P-P 

2 V 

P-P 

3 V 

P-P 

4 V 
P-P 

6 V 
P-P 

0011 

4c 81 

2,96 

1.91 

1,86 

1,69 

0110 

4,13 

2.25 

2,24 

1.73 

1,53 

1001 

2c 19 

1,94 

1,86 

1,65 

1,50 

1100 

2o56 

2,21 

2,36 

1, 97 

1.74 

Avergej 

3,42 

2, 34 

2.09 

1,80 

1,62 

Table 2 






The Number of "1100" 

Sequences Encoimtered by Shifiing Along the 

Adaptive Delta Modulator Output Bit Stream (In Unites of Ten Thousands 

Of Bits) 









Input Voltages 



1 V 

2 V 

3 V 

4 V 

6 V 


P-P 

_ P“J>_ 

P-P 

P-P_ 

P-P 

Number of 






"1100" 

89 

66 

55 

50 

41 

Sequence 






Total 






Number 

500 

500 

500 

500 

500 

Of Bits 






Hatio of Bits 






in "1100" 

71% 

53% 

44% 

40% 

33% 


Sequence fo. 
The Total 


Number of Bits 



■IabIa-3- 

The Number of "llGOll" Sequeuce Being Searched (Bloclcs of 6) 


Input Voltages 


1 V„ 


P-P 


2 V 


P-P 


3 V 


4 V 


P-^P 


P-P 


6 V 


P-P 


Number of 

"llOOll*' 

Sequence 

40 

27' 

22 

20 

16 

Total Num- 
ber of bits 

500 

500 

500 

500 

500 

Eatio of Bits 
in "110011” 
Sequence to 

48% 

32% 

26% 

24% • 

19% 


The Total 
Nixmber of Bits 



xable 4 


Probabilities of 16 Possible Sequences at DUIerent Input Voltages 

Input Voltages 


Sequences 

1 V 

_ _ PJ!P 

2 V 
P“P 

3 V 
PrP 

4 V 
P-P 

6 V 
P-P_ 

20 V. 

0000 

0.006 

0.020 

0.022 

0.028 

0.033 



0.035 

0001 

0.035 

0,050 

0,033 

0.037 

0.040 

0.070 

0010 

0.018 

0.042 

0. 077 

0.081 

0.080 

0.088 

0011 

0.245 

0. 162 

0.092 

0.090 

0.080 

0.053 

0100 

0.047 

0.068 

0.048 

0.050 

0.054 

0.105 

0101 

0.004 

0.011 

0.018 

0.022 

0.027 

0.018 

0110 . 

0.243 

0. 158 

0,143 

0^128 

0.111 

0.088 

0111 

0.008 

0. 024 

0.064 

0.066 

0.065 

0.036 

1000 

0.008 

0.022 

0.064 

0.061 

0. 084 

0.053 

1001 

0.133 

0. 132 

0. 118 

0.105 

O.IGO 

0.088 

1010 

0.004 

0.011 

0.018 

0.024 

0. 027 

0. 018 

1011 

0.063 

0.072 

0.050 

0.055 

0. 058 

0. 105 

1100 

0. 125 

0.116 

0.114 

0.103 

0.088 

0.053 

1101 

0.020 

0.044 

0.083 

0.083 

0.081 

0.088 

1110 

0.037 

0.050 

0.037 

0.039 

0.042 

0.070 

1111 

0.004 

0.018 

0,020 

0. 026 

0. 031 

0.035 


Table 5 

Entropies- of the Output Bit Stream of This Abate Mode Adaptive Delta 
Modulator When Blodcs of 4 Are being Used For Coding 

• liq>ut Voltages 

1 V 2 V 3 V 4 V 6 V 

P-P P-P p-p p-p p-p 

Entropy 3*03 3«56 3o78 3o82 3.86 
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Figure 3. When signal voltage and frectuenoies are low, the Abate Mode Adaptive 
Delta Modulator responds with segments of steady state patterns. 



Figure 4, Block diagram ol* experimental procedure 
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0011 

0. 162 

0110 

0, 158 

1001 

0.132. 

1100 

0. 116 

1011 

0.072 

0100 

0. 068 

1110 

0. 050- 

0001 

0.050- 

1101 

0. 044- 

0010 

0. 042- 

0111 

0. 024“ 

1000 

0. 022” 

0000 

0. 020” 

1111 

0. oio- 

1010 

0.011” 

0101 

0 . on- 

Hgure 

10(a). 




0 


0ol40 


0 . 


g86 


TJ7DST 


0* 038 


0.022 




■D:K92 


0.272 




Ti 

X 


■ on)'S4 ° 


0. 044 


0.2281 


0.408 


0. 180 


1.000 


0611; 000 

1101: 1110 

0110: 001 

0010:1111 

1001 ; Oil 

0111; 10100 

1100:100 

1000; 11010 

1011 ; 0100 

0000: 101010 

0100:0101 

1111:101011 

UlO; 1011 

1011:110110 

0001 : 1100 

0101 ; 110111 

Pigiure 10(b). 

The code words 


The average code word length , N=3)((0. 162+0. 158+0.132+0. 116) +4X( 
0, 072+0. 068+0. 050+0. 050+0, 044+0. 042) + 5 X(0, 024+0. 022) + 6 X(0. 020+0. 018+ 
0.011+0.011) = 3.598 
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Received Sequence 

0000 

0101 

1010 

nil 

Output Sequence 

0100 

0100 

1011 

1011 


Figure 11(a). Converting the least probable sequences to more 
probable sequences 


IIGO 

1011 

0100 

1110 

0001 

iiei 

0010 

0111 

1000 



Figure 11(b). Constructing the Hufbnan code 


IlOOi 0 
1011 ; 101 
0100 ; 110 
1110 : 1001 
0001: nil 


1101 : 10100 
0010:10101 
0111:11100 
1000: 11101 


Figure 11 (g). The code words 


The average code word length, N = 1x0.568 + 3x( 0.090 + 0.101 ) + 4x 
( 0. 050 + 0, 050) + 5x( 0. 044 + 0. 042 + 0.024 + 0, 022 ) = 2. 228 



Id 3* Processing and ConyersiQti q£ Delta ModulatioMi Encoded Signnis 


Ao j^erformaace of a Digital Adaptive Delta Modulator 
Id Introduction 

The most general form of a digital Delta Modulator (D]\I) is depicted in Fig. 1. 

Here we have assumed that the irgjut signai, x(t)j is bandUmited to f^ and is 

sampled well alaove the Nyquist rate, i. e, , f » 2f . A general matliematleal 

o xtx 

description of a digital DM is given By the following set of equations s 


e^(Ic) = sgn[ 

|^(k)=xac)-i(k) 


and 

x(lt) =x(k-l) + S^ Oc) , 

where 


S (k) = step size at the k™ interval. 

The particular type of DM is specified by the step size algorithm used to formulate 
S (k). ' We shall be concerned with the Song Algorithm (1) that is used with audio 
signals. For this type of adaptive DM the step size increases linearly and is 
given by: / 

(k) = I \ (k-») I e,, (k‘l) -I- s Jk-2),) 


where 


S = magnitude of the minimum step size, 
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In order to evalxiate the performance of the Song audio mode DM, we shall 
let X (t) be a sinusoid of frequency f and let f be an integral multiple of f » 

tH S III 

Wlienever the sampling frequency is an integral multiple of the sinusoid frequency, 

A 

the DM estimate, x (Ic), will assume a periodic sinusoidal stea^ state pattern# 
Since the estimate is periodic, it can be expressed as a Fourier series and the 
fundamental as well as the hi^er harmonics can be calcidated. We can then 
pass each frequency component tlini a realistic low pass filter and use the 
resulting filtered estimate to calculate the output sigiml-to~noise ratio# 

2« Fourier. Series BepresentatiGU of ihe DM Estimate 

A 

In order to facilitate tlie de3dlvation of the Fourier series for x (k) we shall 
assume that its fundamental frequency is f and not a submultiple of it# If we 

A 

let f g = P f and T = 1/f , then this means tliat x (k) periodically talces on P 

A 

discrete values every T seconds. Using the continuous notation, x (t), the 
Fourier series for tlie DM estimate is 


X (t) = G +S C Gos (277 a f t + (p ) , 
n-1 


where 


C^ = (l/T)/ x(t)dt, 


e =/A +B 


(0 = - arctan (B / A ), 
^n ^ n ' n 


\ = (2/T) f X (t) cos (2ir art) (it. 




^ X (t) sin (277 nf t) dt. 
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Since x (t) takes on P discrete values in a period of T seconds, if we represent 

A 

these values as x., then Eqs, (2e) and {2f) can he written as 
J 

N 


jT/N 

A = (2/T) S X r cos (2irnf t)dt 

“ j =1 •'(j-l)T/N ^ 


(3a) 


and 


N ^ jT/N 

B =(2/T)S X. r sin( 277 nf t)dt. 
n J J , ^ 

J-1 (j-l)T/N 


(3h) 


Using the fact that f^ = 1/T and some trignometric identities, and B^^ reduce to 

N 




(4a) 


and 


N 


S X. sin [ n?r (2j - 1)/N ] , 
J ” 1 


(4b) 


Equations (4a) and (4b) represent the simplest expressions obtainable to 
determines the strength of the Fourier components, G^. It is easy to see 
that the digital adaptive DM as well as this method of obtaining the Fourier 
series for the DM estimate are both readily adaptable for computer simulations 
on almost any digital computer, 

3. Output Signal-to-Noise Batio 

Since we are concerned witli an audio mode DM it would seem reasonable 
to choose a low pass filter that is applieaJ^le to voice signals. A common 
low pass filter is a fourth order Bn-tterworth tj'pe whose magnitude-squared 
transfer function is given as 
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1 H (s)f = l/C 1 + (s/u/ -] (5) 

where 

w = tlie radian cutoff frequen<gr„ 

C 'V 

We are interested in the frequency characteristics of this low pass filter and find them • 
to be 

law | = + (6) 

where . 

f ==to /2if, 
c c' 

To realistiely represent a voice signal we shall choose f^ = 600Hz and 

f a 4 f = 2400Hzo From Eq« (6) we now obtain the attenuation factor, a . 
cm 1 \ ^ 

that must be applied to the Fourier components of x (t) in ox’der to simulate low 
pass filtering, 

= E 1 + (n/4)® . (7) 


Since all harmonics are orthognal we will only be concerned with the attenuation 
produced by the low pass filter and not consider the pliase shift which aJSses. 
After final low pass filtering, the output signal power is seen to be 

(ci Gj) » 


The output noise power comes from ail the frequency harmonics other than the 
fundamental. After the low pass filter the ouiput noise power is expres sable as 


N 


-is 




(8) 


(9) 
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Thus the output signal-to-noise ratio (SNR) is given as 


SNR 

o 





S (O' c )‘ 
n=2 “ “ 


( 10 ) 


4, Computer Simulation 

Thus far we have successfully simulated the Song audio mode DM on a PDF 8/L 

computer employing 8K of memory* We have observed the response to an input 

sinudoid aild have verified that the DM estimate is in fact periodic. In 

addition, the Fourier components have been calculated using Eqs. (4a) i (4b) 

and (2e) and the resulting output signal-to-noise ratio has been determined. 

Naturally we did not use an infinite number of harmonics t' .calculate the 

noise potver as required by Eq. (9), Instead we tnmcated after the ninth har- 

2 

monic because the term was negligible in comparison to the total 

noise due to the second thim ninth harmonies. 

At this time v/e are oonstructing a family of curves depicting output signal- 
to-noise ratio in dB versus relative input signal pov’er also in dB for various 
ratios of hnd for the minimum step size set to unity. We have found that, 

for the same input signal amplitude, the periodic pattern that tlie estimate 
assumes and consequentlj’- the output signal-to-noise ratio is very dependent 
uponihe starting point of the input sinusoid. In Fig. 2 we show the DM 
response to a constant inp\t. Since the DM estimate is periodic with a 
period of 4T , = 4/f , the input sinusoid can start at anj’- point within tliis 
period with equal probability. In Fig. 2 we also show a number of possible 
starting poiivts of the mput sinusoid. In order to obtain a truly representative 
value of output signal-to-noise ratio we are currently averaging the output signal- 
to-noise ratios obtained for 20 different starting points of the input sinusoid. 

B. Direct Arithmetic Processing of Delta Modulation Encoded Signals 

A teelmique for adding and multiplying signals that are DM encoded without 
first converting tliem to a Pulse Code Modulation (PCM) format has success- 
fully been developed. The results of tills investigation were presented as a 
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paper, which is included in the appendix, at the National Telecommunica~ 
tions Conference, December 1-3, 1974, in San Diego, California, 

In this paper we develops both an addition/subtraction algorithm and 
a multiplication algorithm for DM encoded signals. We present the systems 
to realize these algorithms and show that for constant inputs tlie performance 
of these systems is identical to the performance of PCM adders and multiplLerSo 
In additim we display experimental results when elementary signals are used 
as inputs which verify the theory that has been developed, 

At the present time we are attempting to apply the Fourier series theory 
developed above to DM encoded signals which have been subjected to direct 
arithmetic processing. By forming the direct sumand product of sinusoidal 
input sign^s which have first been DM encoded, we will be able to produce 
output signal-to~noise ratios for both the direct sum and the direct product. 

C, DM to PCM Convergioh 
lo Basic Pliilosophy 

Frequently the situation arises where a signal has been DM encoded and 
transmitted but at the receiver we wish to use the signal information in a 
system that requires a PCM encoded signal. Since both DM and PCM are 
digital encoding techniques, we would like to avoid demodulating the DM 
signal into analog form and then recoding it in PGM form. Thus the need 
arises for a direct, all digital method of conversion from DM to PGM, 

The most obvious method to convert from DM to PCM would be to pass 
the DM bits, e (k), thru the DM digital feedback circuit, as shown in Fig, 1 , 

^ - A 

and produce tlie DM estimate, x (k). Since tlie DM operates at a rate much 

A 

higher tlian tlie Nyquist rate, it would be necessary to gate x (Ic) at the 
Ifyquist rate so as to obtain PGM format. The problem with merely using 
DM estimate values that ocoim at the Nyquist rate for otir PCM sample values 

A 

is that w'e run the risk of obtaining a "poor" value of x (k). By Su "poor" value 

A 

of X (Ic) we mean one wMch ejihibits a large variation from the true PCM 
sample value, x(k), A "poor" DM estimate is frequently produced when the 


37 ) 


DM step size has grown too quickly causing the estimate to overshoot the 
true signal. If the true signals continues to increase, the estimate will 
reverse direction for one period, due to the overshoot, and then continue 
to increase. During the one period when the DM has reversed direction, 

A 

the x(t) value is generally a "poor" estimate. When the DM is demodulated 
to an analog signal, the " poor" estimate values are easily averaged out by 
a low pass filter because the DM operating frequency is much higher than 
the Nyquist rate. However, since the PCM samples occur at the N 3 >’quist 
rate, a "poor" value will give rise to a considerable error even after final 
low pass filtering. 

A 

In order to eliminate these "poor" values of x (Ic) , and still maintain a 
completely discrete system, we can insert a digital filter after the DM 
estimate, just before the gating device operating at the Nj'quist rate. The 
digital filter may be viewed as a device which filters in the frequency domain, 
produces a statistical estimate, or performs a digital interpolation. In all 
cases the result is to decrease the out of band noise and make our estimate 
values more accurate. 

2. Nonrecursive Filtering Technique 

The use of a nonrecursive filter to achieve DM to PCM convei’sion was 
first invertigated by D. Goodman (2) who only considered the case of a linear 
DM and used a minimun mean square error design criterion to determine the 
filter eoeffieiettts. In order to complete his design, it was necessary to 
assume input signal statistics. We have dealt with the more general ease 
of any digital adaptive DM, In additon, we use a method to determine filter 
coefficients wbrch is completely independent of input signal statistics. 

The basic DM to PCM conversion scheme for any digital adaptive DM 
first forms the step size, S (k), from the DM bits, e (Ic); then acctimulates 
the step sizes to produce the DM signal estimate, x(l?;). These operations 
are performed at the DM operating frequency , 


f 

s 


- 2Nf 


m 


38) 


where N is a positive integer and the input signal, x(t), is assumed to be 

A 

bandlimited to f , Finally we gate x (Ic) at the PCM or Nyquist frequency. 

m ’ 

f^=2f , 
w m 

n‘ 

to produce the PCM samples, x(Nlc), This scheme is shown in Fig. 3. 

A 

Jh order to eliminate the poor values of x (It) , we can insert a low 
pass filter after the accumulator. Now our converter takes the form of 
the DM step size, S (k), feeding two cascaded linear filters. The accumu- 
lator can be represented as an ideal integrator whose impulse response is 
given as 

a (t) = 1 , t a 0 

= 0 , t< 0. (11) 

Let us designate the impulse response of the nonanticipatory low pass filter 
as 


h (t) , t s 0 

h (t) = 0 , t < 0 . 

In order to complete this converter we must gate the output of the low pass 
filter to yield the improved PCM sample, s (Nit) , In Fig. 4 we present this 
system before the two cascaded filters, a(t) and h(t) , liave been transformed 
to a nonrecursive digital filter. 

Since both a(t) and h(t) represent linear filters, they can be combined into 
one linear filter, g(t) , where 

g(t) — a (t) *h (t), 

00 

a (t^X) h (X) dX. 


(12) 
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The upper limit in EQo (12) becomes t because the accumulator is nonan- 
ticipatory, i«eu , a (t-X) = 0 for X > t, and the lower limit becomes zero 
since the low pass filter is causal, i.e» , h(X) = 0 for X < 0. We also notice 
that for these limits of integration a (t-X) ® 1. Therefore Eq» (12) reduces to 

t 

g(t)=J' h(X)dX, (13) 

o 

and this is merely the unit step response of the low pass filter. 

Now we can express the filtered DM estimate, x(lc) , by the following 
discrete convolution; 

c6 

x(k) S (fc-i) g(i) , (14) 


where 


gO-g^jTg) 


and 


T = 1/f = the DM sampling period,, 
s s 


The lower limit of the sum in Eq, (14) becomes zero because Ihe filter 
g(t) is causal, i.e. , g(j) - 0 for j < 0. Thus we Iiave 


C& • 

x(k)=S S (k-j) g(j),- (IS) 

j=0 * 


Since g(t) represents the unit step response of a low pass filter, we Icnow 


that 



lim g(t) - 1 
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orlimg(j) = 1, 


( 16 ) 


and that there exists a value of j (or t) for whieh g(j) is arbitrarily close to 
1. U we call this value Q, then 


g(j) “1 for all j s Q. (17) 

Using this fact, the filtered DM estimate can be ap proximated veiy closely 
by 

N-1 ® 

X (k)=“D s (IH) g(j) 

j=0 ^ j=N 


If we notice that the second sum can be rewritten, lettir^ k - j = i, as 


OJ j-to 

k-N 


? 

11 

.(i) S^(i), 

(19) 


and recall that the DM estimate is given by 
k 

i(k) =S yi), (20) 

i=_CO 


then wfe can ultimately express the filtered DM estimate as 

N-1 

x(k)- z: S^ (1H) g‘(j) + x(lt-N). (21) 

Li Fig, 5, we give the block dij^ain of this nonreeursive digital filter. 

Our design is now complete except for the choice of the filter coefficients, 



SO)* Bi order to aeliieve the best out of band noise rejection without any 
in band signal deterioration, we choose g(j) to be samples of the unit step 
response of an ideal low pass Mtero Although an analog low pass filter 
is not physically realizable, when a nonrecursive sigital filter is constructed 
we can choose any coefficients desired to simulate a given characteristic. 

Ih Fig, fa' we show tlie unit step response of an ideal low pass filter plotted 
on a normalized abscissa. Analytically g{t) is expressed as 


g{t)=-| + {l/Tr) Si(w^t-K^) 


where 


O' 


Si (ff) = / dx, 


w 5 the radian cutoff frequency, 


K = u T-, 
o e d 


and 


= the time delay of the filter. 
Since g(G) = 0 and Si (a) is an odd funetion, 


Si (K^) = 7t/2 or = 1. 926 radians. 


Jn order to obtain the filter coefficients, we select N points of g(t) in 

the interval 2 w T Thus we set 
c d 

T =2Tj/N, 
s d' 


and %vrite g(lc) as 
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g(k) = i + (I/tt) Si (w^kT^ - ly . (24) 

Usii3g Eq. (23) and tJie fact that = K^/w^, we obtain a final expression 
for the filter ooeffieients i 

g(l^) - 1 + (1 /tt) Si C K ( (2k/N) -1) ] . (25) 

o 

What should be pointed out at tins point is tliat tlie eoefficients obtained from 
Eqo (25) are mtlnn 1% of the values obtained by D. Goodman (2) for the case 
N = 5 which is doeiunented in tliis reference. More significantly we stress 
that these coefficients were derived independent of inpirt signals statistieso 
Presently we are undertalang a computer simulation of this system in 
order to oljtain signal-to-noise ratio curves. 

3, Recursive filtering Technique 

Jf we approach the problem of achieiung DM to PCM conversion from 
a strictlj’- digital system point of view, tlie solution lends itself to the use 

A 

of recursive digital filters. The objection is to relieve x(k) of its "poor" 
estimate values before the PCM samples are gated out, TMs can be 
achieved by a sharp cutoff low pass filter. These characteristics can be 
best obtained mth minimum hardware by using a recursive digital low pass 

A 

filter. The filter is inserted after the accumulator wMch produces x (1<) and 
before the guting device which renders die PCM samples, x (Nlc). 

The recursive filter design teeliniques which have been used are the 
impulse invariant method and the squared-magnitude method. Both design 
procedures are well document by Gold and Rader (3). Currently we are 
undertalang computer simulations to obtain the performance of this system 
with various different types of recursive digital low pass filters. 

Do PCM to DM Conversion 
' 1. Statement of the Problem 

Consider the case where a signal is encoded in PCM format but we vdsh 
to use a digital processing technique that requlres tne signal to be X)M 
encoded. Mow we wish to convert from information arrivin g at the Nyquist 
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rate, 2f^, to a DM form wMch has a frequency of occurence which is N 
times faster. That is, the DM operating frequency is f = 2Nf . hi 

S XQ. 

addition, we would like to confine our conversion technique to all digital 
hardware. 

It is evident that this problem is much more complicated than the 
DM to PCM conversion problem considered in the previous Section. In 
PCM to DM conversion we do not have information about the signal 
excursion between PCM samples and a DM estimate can foUow several 
paths and still pass thru the given PGM samples. Thus we must address 
ourselves to a metliod which first obtains additional sample values betv'een 
the PCM samples and then uses these to choose the correct DM estimate 
path. Since the DM estimate is directly related to its output bit stream for 
a prespeeified digital adapMve DM, we will have them achieved the desired 
conversion. 

2. Submultiple Sampling Technique 

Although there are many approaches to this problem, the soluticn 
presented here is consistent witii previously developed conversion methods 
in that it appeals to a basic theoretiGal principle and resvdts in a system 
which is easily phj^sieally realizable with the current state of the art digital 
hardware. We normally e3q>ect that the best way to demodulate the PCM 
Signal would be to pass it thru an analog lo w pass filter. By doing this, we 
extract all the information between PGM samples. However, we seek only 
a finite number of additional sample values between the PGM points, hi 
order to achieve this, we use a low pass filter which is not only digital but 
which operates in a submultipie sampling mode (4). 

When a digital system operates in tlie submultiple sampling mode it 
means that tlie system produces outputs at a frequency which is an integer 
multiple of the input frequency. This is exactly the cireumstanees that exist 
when we derive additioiml sample values from the PCM samples. To facilitate 
the entire PCM to DM conversion process, we choose to operate the submultiple 
digital low pass filter at the desired DM frequency, f^ = we can 

use these values as the input to the digital adaptive DM that we are employing 
and tiierefore aiitomaticaEy generate the necessary DM bits, e (It). The only 



restriciiori io this teelmi^e is that the DM must operate at a high enough 
foequency so that there is negligible error between the output of the filter 
and the resulting DM estimate. The general block diagram of the PCM to 
DM converter is given in Fig. 7. 

Ih order to tliorouglily describe the submultiple sampling technique that 

we ace invesiigafeiDg, it becomes necessary to dichotomize this technique 

into tn'o types. TJie first type uses the PCM samples as inputs to tlie digital 

low pass filter for only T » 1/f seconds. Li the time between PCM samples, 

s s 

(N-1) T seconds, we Insert zeros or no input; This is analygous to tlie 
s 

situation where the filter input is a series of impulses. The second type 
employs the PCM samples as filter inputs for NT^ seconds. This is similar 
to the PCM samples being held for the Ny^uist period, 

1. PCM Samples with Zeros Enserted 

We shall describe the theory for both cases in a geaei’al fasMcn omitting 
the iDarticular charaGteristics of the low pass filter. Furthermore let us 
start with the ordinary Z - transform of a digital low pass filter represented 
by G(z) and the input and output of this filter specified by X(z) and Y(z) res- 
pscfively. Then as we normalty ejipeet, 


Y(z) = G(z) X(z). 

In addition, we requii’e that the input occxirs once every Nyquist sampling 
period, T^^ = 

If we want the output to occur n times in the Nyquist sampling period and 
the input, which are PCM samples of X(z) , to have zeros between these PCM 
samples, then the output is e:!q)ressed as 

Y(z)^ = G(z)^X(z), 


where 


is referred to as the submultiple sampling Z-transform of the digital 
low pass filter and is realized exactly the same as the ordinary digital 
filter except delay elements are reduced by a factor a and scalars wth 
the constant included are attenuated liy a factor n. This implies tliat 
the digital fQter operates at a freguency and for only one out of every 
n cycles is there an input which is a PCM Sample and not an inserted zero. 
The filter operates on these inputs to produce estimates of the signal between 
PCM samples o 
2o PCM Samples Held 

Ih order to obtain n outputs per Nyquist sampling period for the ease 
when the input PCM sample is maintained as the filter input of all n cycles 
of the Nyquist period, it is necessary to employ a digital hold circuit, H(z) , 
after X(z)» A digital circuit to hold one value as its output for J periods 
when the held value is the input during the first period and there is zero 

^ periods has the transfer function: 

-1 

z 

A block diagram of this J period digital hold circuit is given in Pig. 8. 

If we incorporate the hold circuit into our digital low pass filter operating 
lathe siibmultiple sampling mode tlien we must hold the PCM sample for n 
filter cycles. The desired PGM-held output Is given by 

Yjj(z)^ = G(z)^H(z)^X(z), 


input lor me secGnci mm me J 


J-1 

m = B(z)_ =^ 
Mz) 


where 
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n-1 

H(z) =2 

i=0 


This completes tlie geuerid tlieory for both types of the submultiple sampling 
PCM to DM conversion technique leaving the option to choose whatever digital 
low pass filter characteristics that may be desired and to set the number of 
filter cycles per Nyquist pexiod, n, to be a specific uiunber, N, wMch is 
large enough so that there is negligible error between the output of the filter 
and fhe resiilting DM estimate, 

3* Normalization 

Because the DM can suffer from both slope overload noise and granular 
noise due to an input which is too large or too small, i'espsctively, we should 
talce the precauLlon to insime that our digital low pass filter does net contribute 
to eitlier of these degradations by amplilying or attenuating the input signal. 
This can be accomplished if we normalize the filter transfer fimction in the 
frequency domain. Since z = exp we noi^malize G(z) so tliat at 

ity=0 it will be unit^'-. This normalization applies to the first type considered 
were zeros are inserted between PCM samples. If we call the normalize/? 
low pass filter transfer function, G* (z) , then 

G’ (z) = G(z)/G(w = 0) 
and 

G'(z)^=G(zya(u) = 0). 

Using the normalized transfer function, tile true filter output is 


Y' (z)^=G’ (z)^X(z), 


(30) 


(30a) 


(31) 



For tile type of converter when the PCM samples are held we seek to 
normalize G (z)^ so that at w = 0 this wiU lae imityo Let ns denote the normal- 
ized held transfer function as G" (z)^^ which is tlien given by 

G"(z)_^ = G (2)^/0 (u = 0)jj. 

Now the trtie output bscomes the e^ressiou 
y^'{z)^ = G”(z)^H(z)^X(a).' 

4, gystem Evaluation 

In order to detex'mine the performance of both types of PCM to DM convei’ter 
we are nndertaidng two different approaches. One mil be purely analytical in 
which we letx(z) equal the Z-trausform of a sinusoid of frequency f^ which is 
sampled at the Nyquist rate. After G(z) and n are chosen, we can find the Z- 
transform of the filter outputi talce the inverse Z-transform; and then evaluate 
the mean square error. 

An alternate approach will be to completely simulate each iy'pe of converter 
and the desired sinusoidal input on a digital computer « We can use the Fourier 
analysis technique described earlier to determine the signal-to-noise ratio of 
the DM estimate. This will give us an even more accurate measure of the 
quality of the conversion system. In both approaches we can vary the filter 
characteristies and observe the effect of filter hardware complexity upon 
performance. The latter approach also allows us to vary n and observe the 
dlKerent sj^stem performance in terms of a change in DM estimate signal-to- 
noise ratio. 



References 


(1) Cp L. Soiig, Jo Garodnick and D, L. Schilling, ”A Variable Step Size 
Robust Delta Modulator", IEEE Trans. Commun. Teeh« , vol. COM-19, 
pp. 1033-1044, Dec. 1971. 

(2) Do Jo Goodman, "The Applicatioa of Delta Modulation to Analog-to- PCM 
Encoding", Bell SySo Tech, J. , vol, 48, pp, 321-343, Feb. 1989, 

(3) B. Gold and C. M. Rader, "Digital Ppocessing of Signals", N, Y. s 
McGraw Hill, 1939, pp, 51-70 

(4) Bo C, Kuo, "Aimlysis and Synthesis of Sampled-Data Control Systems", 
Englewood Cliffs, N, J, s Prentice Ball, 1933, pp. 82-86, 



appendix 


49 ) 


Direct Arithmetic Processing Of Delta 
Modulation Encoding Signals 
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Abstract 

In this paper we shov^ that two or more 
Delta Modulation' (DM) encoded signals can 
be added or multiplied without first con- 
verting them to a Pulse Code Modulation 
(PCM) format. These results are obtained 
for a large class of all digital adaptive 
DMs as well as for the basic all- digital 
linear DM. The addition and multiplica- 
tion is performed by operating direotly 
on the DM bit stream and the sura or pro- 
duct signal is presented as either a DM 
bit stream or in a PCM format. 

Bounds are given on the Signal-to-Quanti- 
zation Noise Ratio (SNR) for these DM 
arithmetic operations and compared with 
the results obtained in PCM. Experi- 
mental results are presented v;hich verify 
the theory that has been developed. 


! 


Introduction 

The conventional approach to arithmetic 
signal processing is to first encode the 
signal in Pulse Code Modulation (PGM) 
format and then digitally process the PCM 
signal via standard parallel processing 
techniques, it is becoming increasingly 
popular to encode signals in Delta Modu- 
lation (DM) format, where digital data is 
presented in a serial rather than a 
parallel fashion. If we wish to arith- 
metically process DM encoded signals, it 
becomes necessary to perform the addi- 
tional operation of conversion from DM to 
PCM before processing is initiated. It 
is, however, possible to avoid this addi- 
tional operation of conversion and to 
produce the sum, diffei'cnce and even the 
product of DM encoded nignals by direct 
arithmetic processing of the .serial data. 


J. GARODNIGK ^ 

Goldmark Communications Corp. 
Stamford, Connecticut 


In Fig. 1, we show the basic form of a 
digital DM. Here we have assumed that 
the input signal, x (t) , is bandlimited to 
fm’ and is sampled well above the Nyguist 
rate, i.e., is >> The most general 

mathematical description of a digital DM 
is given by the following set of equa- 
tions : 


e (k) = sgni;4 (k)], 

X X 

(la) 

Ijj(k) = x.(k) - x(k) 

(lb) 

x(k) = x(k-l) + S (k). . 

(IG) 


where 

S^(k) = step size at the k^ Interval. 

The particular type of DM is specified by 
the step size algorithm used. to formulate 
Sjj (k) . For the linear DM, 

S (k) = Se (k-1). ' , (2) 

x' ' X 

where 

S e magnitude of the minimum step size. 

Although there are many step size al- 
gorithms for adaptive DMs, where the step 
size adapts to the input signal power, we 
shall be concerned with the class of DMs 
derived by minimizing a mean square cost 
function, i.e., those described by the 
Song Algorithm [1] . For the case of the 
Song audio mode DM, where the step size 
increases linearly, 

S„(k)= K(k-l)ie (k-1) + Se (k-2), (3) 

and for the Song video mode DM, where the 
step size increases exponentially, 

Sj^(k) = IS^(k-l)|Le^(k-l)+le^(k-l)], 

= 2Se^(k-l), lSj^(k-l)l<2S. 

The special region, |Sx(k-l) 1*^2S, is needed to 
prevent a dead zone where driven 

to zero. 
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ftdclitlon/SubtracLiQn of DM Kncodcd Sinnale 

bwo MliinaViif h(<:) ai\i1 y(Oi bnl.U 
b.iltil t tllll t ml | i> t'ln ai>il i'H| It I’M HI’ 

UihI w<i imly ltH\'it lilt’ H<«|ti<iiiMiitt (K) I 
'mtil(«y(U)l lUM I tiililn, TliiHi oiinit iiliir l.hii 
imililom of obtaiiil.Hii tint mtm of tho/io l;wo 
nl'iiialn. Wo can ijnrfonii airU.limotlc pro- 
ccinninq on{Ox{k>) and {Oy (k) ) and form 
tho dI.rooU imm, an (-.ho mim of tho 

individual, nlqna.1. onklmahont l.o., 

njj(k) = x(k) + y(k). , ('») 

Using tho DM expression for the estimate 
of x(t) and y(t), Eq.(lo)t we can form a 
recursive relationship for the direct sum, 

ajj(k) = aj^(k-l) + S^(k) + Sy(k). (B) 

where Sjj(k) and Sy (k) are formed directly 
from fexlk)} and(ey(k)]. 

The direct sura, (k) , is available in 
PCM format, because of the recursive re- 
lationship by which it was formulated, 
i.e., Eq. (6). To obtain the DM bit 
stream of the sum, {e (k)) , v;e merely 
pass ajj (k> through a digital DM. A block 
diagram showing the structure used to ob- 
tain the sum of the DM encoded signals is ■ 
• presented in Fig. 2. The DM digital 

feedback circuit sliown in this figure is, 
in general, constructed v;ith the appro- 
priate step sire network followed by an 
aceumulator. V?.ith this in mind, v;e 
notice that in order to pliysically 
realize the entire DM direct sum system, 
it requires only a full adder, an ac- 
curaulatGr and the necessary .step size 
network, all time shared. 

To subtract DM encoded signals, we just 
add the negative of the subtrahend signal. 
If we wished to form x(t)-y(t), all v?e need 
do is change +Sy(k) to -Sy (k) in Eq. (.6) 
and the result would be the direct differ- 
■ ence.. Thus, subtraction takes the same 
structure as the addition shown in Fig. 2. 


It is to be noted that the sti'ucture de- 
rived is completely independent of the 
type of DM and is therefore universal for 
any digital DM. In order to realize the 
direct sum of signals encoded by a parti- 
cular type of DM, we must construct the 
circuitry used to formulate the step size 
algorithm employed in the original DM 
encoder. For the modes cited above, the 
step size circuitry can be constructed 
Using the standard digital hardware i.e., 
full adders, delays, sealers and Ex- 
clusive — C/R gates. The dM adder for the 
linear mode and the Song audio mode is 
shown in Figs. 3 and 4, respectively. 


Avcrag -inij 


Till’ Mlm-uly Htal'« initpiuitin nf n1.1. lypuH of 

illgllHl DMm 111 H I'Mitiil mil. lh|Mil. In nii 
Hill, tiiidi It iiigtml v/lilcli Hirlill’IlH n |iHilrii 1 l(i 
pllU.iiril. I''nr II llllH/ir DM I.Iih HHtlliiHl.H l.il 
a fiimi’lo nqiuirn wave with a porlocl of tv/o 
nnmp.llng jntorvalii. For any Dong mode DM 
the poi,'.ibd .Is four sampling intorvaj^n. 

In Fifj . 5 wc show n typical steady state 
oMtim.-ite signal for the .gong audio rnodo. 
In th.I.s figure, tlia constant input,, x, has 
been quantized to and m is a non-nega- 
tive integer. " 

Since the direct sum, aplk), produced by 
a DM adder was formulated as the sura of 
the individual signal estimates, Rq. (5), 
we naturally expect the direct sum to 
also exhibit a periodic pattern when .re- 
sponding to constant inputs. For the 
Song audio mode, there are four possible 
steady state direct sum waveforms; one of 
which is given in Fig. 6. In this figure, 
n and r are both non-negative integers. 
The important property of the four-possible 
steady s-_ate direct sum patterns is that 
•the arithmetic average of any four conse- 
cutive values of ajjCk) is always equal to 
x_ -t y„ . This fact inspired the use of a 
four term non-reeursive filter after ajj (k). 

The four term averaging filter employed 
is described by the follov/ing equation: 


A(k) - i[aj^(k) -t a j^(k-I ) + ap(k-2) + aj^(k-3}]. (7a) 

If we apply Eq. (7a) to the waveform given 
in Fig, 6, the result is 

A(k) = x^ + (7 b) 


for all k, as long as ap (k) has reached 
the steady state. Thus it is seen that 
after a four term averaging filter, the 
DM sum produce.^ the same result obtain- 
able by PCM addition. 

In order to illustrate the function of 
the four terra averaging filter, we form 
the digital transfer function, H (z ) . 
Assuming zero initial conditions and 
taking the Z-transform of Eg. (7a), we 
obtain 

H(z) = -f^,^=’i(l + z*'’' + z“^+z’"^). (Q) 

To display the frequency characteristics ; 
of this filter, we let z = exp(JwT) and 
form 

H(u) = exp(-J3(jT/2) gos(uT/ 2) cos(oT), (9) 
where 

T=l/{^. 

In Fig. 7 we plot (H(w) j = |cos(wT/2) cos(uiT) I 
bn an abscissa normalized to fg . 

From Fig. 7, we observe that H{io) has a 
zero at 1/4 fg , 1/2 fg and 3/4 . It is 
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precisely the zero at 1/4 fg that elimin-* 
ates tiia four sampling interval periodic 
component produced by the Song mode DM, 

WO also note that the four term averaging 
filter exhibits low pans filter character- 
iatics since it slightly attenuates oven 
baseband frequencies. Thus eare must be 
taken in utilizing this filterr because 
it can introduce some distortion to base- 
band signals, . 


Multiplication of DM'Kncodod Sie;nals 


If WG wish to multiply x(t^ and y(t) and 
if we only have the sequences [Oj^Ck)] and 
fSytk)) available, then we can form the 
direct product, Pj,(k), by using the pro- 
duct of the individual signal estimates, 

= x(k)y(k). (10) 

As is the case of the direct sum, we can 
develop a recursive relationship for .the 
direct product, , 

Pd(^) = Pd(^~ V ^ 


+Sx(k)y(k-f) + Sx(k)Sv(k). 


( 11 ) 


The basic block diagram snowing the direct 
product, both in PCM format [PD(k)) and in 
DM format [ep{k)), is given in Pig. 8. 


Since the direct product, p (k) , is formu- 
lated as the product of the^ individual 
signal estimates, as in the case of the 
direct sum, we expect hlic direct product 
to exhibit a periodic pattern when re- 
sponding to constant inputs. Similar to 
the direct sum, the Song audio mode ex- 
hibits four poss-ible steady state wave- 
forms for the direct product. In Fig. S 
we show a typical steady state waveform'. 
The eonstants and c^ depend upon Xg 
and yq and Che amplitude of the steady 
state error pattern (^-Xq and y-yq) , 
while d^ and ^2 depend only on the latter 
of these two- Notice that the arithmetic 
average of any four consecutive values of 
P[j(k) is always equal to Xqyq plus a 
second order terra depending on S^. This 
warrants the use of the following four 
term non-recursive filter after p-^(k); 

P(k) = iljpj 3 (k) + Pj^(k-l) + Pjj(k-2) + Pj^{k?3)J. (i2a) 

Applying Eq. (12a) to the waveform shown 
in Fig. 9, vje see that 

P(k) = xy +i(di+d2)S® (12b) 

Q c} 

for all k, as long as p^ (k) has reached 
the steady state. For any reasonably 
small value of step size, the term 
l/ 2 (^+d 2 )S‘^ is negligible and thus, after 
the four terra averaging filter, the DM 
product yields a resdlt almost identical 
to the PCM product. 


Returning to Fig. 8, we observe that al- 
though the structure derived for the 
direct product is again seen to be uni- 
versal .for any digital DM, it will only 
be useful if the step size algorithm 


is such that we can recursively -realize 
the partial products, that is, the last 
throe terms in Kq. (11). For the linear 
mode this is not difficult, and the direct 
product is I'loen to be 

PdC') = Pn(^- ' ) + ‘'^°y ^ ‘ 

+ SCx(k- ! )y (k- 1 ) + S*ex(k- 1 )ey(k - J ) . 

In Fig. 10 we show the black diagram for 
the DM multiplier for the linear mode. It 
is important tc- -.-'jint out t)iat the ease in' 
realization of the direct product for this 
mode stems from the fact that there are rc> 
non-linear operations involved, only 
simple scaling and niultiplieation by +1 
or -1 . 

In order to recursively realize all three 
partial products for the Song audio mode, 
we must use the following step size re- 
lationship which is a property of all 
types of DMs: 

5^(k)= |S^(k)Ie^(k-l). (14) 

Prom Eq. (3) , v;e see that this property is 
applicable for the Song audio mode as long 
as ISjj(k-l)|^S. Employing tlie step size 
relationship, the partial products for the 
Song audio mode can be expressed as: 

S (k)A(k-l) = Sy(k-l)}^{k-2)e (k-l)ey(k-2) (15) 

■ + Sx(k- l)Sy (k- 1 )ey(k- 1 )Oy (k-2) + ^(k- 1 )Sey (k-2) , 

Sx(k)’^(k- 1 ) = Sx(k- 1 )y{k-2)ex(k- l)Gv(k-2) ( 1 G) 

•^Sy(k-l)Sx(k-l)Gx{k-!)cx(k-2)■^y(k-I)Sex(k-2), 

= |Sx{k- V)Sy(k-l)|ex(k-l 
[ex(k-^)ey(k-2)•^SjSy(k- 
-^S^Cx{k-2)ey(k-2). 

It is easy to see that Eqs. (15), (16) 
and (17) can be constructed using adders, 
delays, scalers and Exciusive-OS gates. 

In Figs. 11, 32 and 13 we show the block 
diagrams of the partial products for the 
Song audio mode as specified in Eqs, (15), 
(16) and (17) respectively. 

We ean also construct a DM adder and 
multiplier for the Song video mode, as 
defined by Eq. (4). They axe similar to 
the Song audio mode aoviees , since the 
step size algoritlim takes on a comparable 
structure. Likewise, the step size re- 
lationship given by Eg. (14) is also ap- 
plicable as can readily be seen from 
Eq. (4). 

It is important to realize that all of 
the systems previously mentioned are ac- 
cumulator or positive feedback type. For 
both the adder and the multiplier, for 
all DM modes, the present output is equal 
to the past output plus additional terms, 
Tl:us it is significant to begin with the 
correct initial condition for the past 
output, or else suffer a constant offset 
error, it is convenient to start with 
both signals, x (t) and y(t), at zero so 
that we ean -employ a zero initial condi- 
tion for the past output. 


Sx(k)Sy(k 

SlSx(k-l) 


)Gy(k-l)-»- 

l)|ey(k-l)ex(k-2) 

(17) 
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Siqnal-To-Woise Ratio 


The siqnnl-to-noiso ratio £or the sum of 
two PCM encoded signals is well-known [2 ) , 
and is given by 

SNRg(PCM) = C(or^2^0yVs^ (18) 

After nonrecursive averaging, the error in 
a DM encoded signal can bo shown to be 
equal to that of PCM, and hence for a con- 
stant input, 


SNR^(DM) = SNR^{PCM). . (19) 

The signal-to-noise ratio of the product 
of tV'o PCH signals is fll^O well-known and 
is given by 

SNRp(rcM) = ‘ 

If the two signals have equal ooV7er and 
the step size small (o * = o ^ = a^»S*), the SNR 
becomes ^ ^ 

SNRp(PCM) 2= Gcr^S^* ' (2013) 

For the direct product of DM encoded sig- 
nals we can develop an expression for the 
signal-to-noise ratio. Again, we include 
the averaging filter introduced in Eg. 
(12a) and the DM error is formulated as 
fp=p.p, (21a) 

where 

P = P(k) = XqYjj + 6S^ (24 b) 


Let us now define the product signal-to- 
noise ratio for DM signals to be 

SNRp(DM) s'^/Vardp). (22) 

Since "p^ = o^Uy , we evaluate 

■^=6S^ (23a) 


6 V, 

and 2 

Var(Cp) - 4p - ^p " 

But this means that 


Var(Cp) = Var(Cp), 


and 


14 4 a ^ 


SNRp(DM) = 


X y. 

2 lals . 


12(a ^+a ‘‘)S“ + S’ 
' X y ' 

2 _ 


Again taking Ojf = Cry = Cf and a »S 
this result becomes 

SNRp{DM)2: 6a^/S^ 


(23 b) 
(23c) 
(23d) 


(24a) 


(24b) 


As in the ease of addition, the conclusion 
is that the performance of the direct PEM 
multiplier followed by the averaging fil- 
ter is identical to the PCM multiplier 
.performance for constant input signals. 


Simulation Results 

A computer simulation has been completed 
for the direct arrithmetic processing of DM 
encoded signals, and results have been ob- 
tained Cor both the direct sum and the 
direeb product employing the Song audio 


mode DM, Both the direct sum and the 
direct product were passed through the nan- 
recursive four term averaging filter men- 
tioned previously. The sum results are 
shown in rigs,. 14 thru Hi and the product results 
are shown in Pigs. 17 thru 19. In all simulation 
results both tlie step size and the sampling period 
have been normalized to unity. 

Fig. 14 Shows the sum of a step function 
and a pulse; Fig. 15 displays the result cf 
adding a step function and a sinusoid; and 
Fig. 16 gives the addition of two sinusoids 
with the same amplitudes and frequencies 

that are ^:n phase, in Figg, 17, 10 and 19 

We show the prodilct of the signals that 
were added in Figs. 14, IS and 16 respec- 
tively. These simulation results verify 
the theory developed for both the DM / 
Direct Sum and the DM direct product. VJe 
mush particularly emphasize the role 
played by the four term averaging filter 
to achieve both sum and product results 
that are so accurate. To fully appreciate 
the effect of this four term averaging, we 
show in Fig. 20 the direct product. of a 
step and a pulse without the four-term 
averaging. Comparing this with Fig. 17, 
which is the result after four term avera- 
ging, clear demonstrates the important 
role played by this filter. 

As a concluding remark, v;e observe an in- 
teresting by-product of the DM multiplier ^ 
In Pig, 21 we show the response of a Song 
audio mode DM to a step of amplitude 150. 
The response time, heeded to reach at 
least 150, is seen bo be 17 sampling 
periods. Notice, from Fig. 18, that for 
the multiplier to reach an amplitude of 150 
it takes only 0 sampling periods. Thus we 
^ave expanded the bandwidth by a factor of 
two, eonsi stent with the previous assump- 
tion of the iriultipliGation process. This 
type of syste:n may be useful when process- 
ing video signals that are eharaeterized 
by abrupt, step-like amplitude variations. 
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NONRECURSIVE DIGITAL FILTER 



Fig. 4. DM To PCM Converter With Filter 
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Fig. 5. Nonreeursive Digital Filter 












li 4. A Pliase Locked Loop with Nonlinear Processor 


Ao latrocIuctLon 

A general configuration for a Digital Phase Locked Loop (DPLL) is shown 

in Figo lo The input signal, f(t), is a bandUmited (B Hz) angle modulated 

waveform (i*e. , FM, PM, FSK, PSK, etc.) of nominal carrier frequency 

f , plus an additive noise signal, n(t). f(lcT ) is the digitized version of 
o s 

f(t), where T is the sampling period, and f ) is the Voltage Controlled 

Oscillator (VCO) waveform in digital format. The Phase Detector (PD) 

extracts the difference frequency tei'm from the product f(kT )«f (kT ). This 

B V 

signal is then hard limited, sealed, and then processed to produce the loop 

outpirt signal, y(ItT ). 

s 

Since in a Phase Locked Loop (PLL) we are interested in extracting the 

phase of the input signal, we can consider the use of a Phase Controlled 

Oscillator (PCO) as a feedback device. The frequency information woxild 

then be the phase difference C ^ (IcT ) - <p (IcT -T ) ] / ATo 

VS V s s 

Let us assimie tliat the form of tlie input signal is some phase modulated 
signal. The digital version of tliis signal (assuming no additive noise is 
present) can be shown to be of tlie form 

fa^Tg) = -2 cos [(k7rTg/2j) + <p^ (IcT^) ] 

where j is an integer greater tlmn zero and modulating 

signal. 

The VCO waveform will be of tlie form 

f, - e C (liT^ ir/23) + (IcT^) 3 > 

where (p^(IiT^) is the estimate of S‘(') is 3- function defined as 

follows ; 
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g(x) =Sq(x) = 


o X < 7T 
7T ^ X < IT 


(3a) 


and 


g(x -{- 2ff) = g(x). 


(3b) 


Also, the VCO phase is given by 
k-1 

- Qyco • ^ 

l=S-0O 

where y(i-^g) is the output of the algoiitlimic proeessor at the time instant 

t = iTg. At this point we wili norJualize to unity for simpUcity unless 
otherwise noted. 

B. Linear Delta Modulator CPhn Type Processer 
1. System Characteristies 

Consider tlie following digital filter to be used as the algoritlimic 
processor. The new estimate, y^c), will be equal to the old estimate 
plus so3ne update data, b(lc), as in a Hnear Delta Modulator (DM), 


y(Ic) =y(lc-l) H-b(k) 

Sgn [ e(lv) ] 

ea^+1) = (f(lc) • g(lc) )^p^ 


In. tlie above, g^ is some scaling factor and lias the following 

connotation. Since g(x) = Sq(x) is a square wave, it can be expanded in a 
Fourier* series to obtain 



(sin X + “ sin 3x + * 


■^(2tiH-i)' 


sin (2n+i) x + • • <>), 


(5a) 
(5 b) 
(5c) 


where n is a positive integer. Therefore the product 
2 cos [ (lev Tg/2j) + (ic) ]• Sq [ (Ic7rTg/2j) + <p^^(ic) ] generates 


( 6 ) 
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harmonics at f/f = G, 2?7/2i, 47f/2j, •*••21 7r/2j, where 1 is an integer, 
s 

If we extract the signal content aroimd f = 0, i. e. , we Low Pass Filter 
(LPF) tliis signal, we obtain the desired ei’ror signa!. This is precisely 
what tlie PD does. 

From Eq. (4) we have 

= (7) 

and so 

(It+1) - 2 <p^(k) + (k-1) = G sgn [ sin (<p^ (lc~l) - (It-1) ) ] (8) 

where G = (4/ff) g, G„ . Equation (8) is a second order difference 

Q VL/v/ 

equation of the loop phase estimate (Ic), Notice that at any sampling 

instant, k, the Eight Hand Side (EHS) is i G. 

Let us assume for a moment that (p = to , a constant phase offset. 

Then, of course, one would Mce <p, (k) to be a cohstaiit approximately 

V 

equal to <p ^ since is an estimate of <p^ (Ic). In particular one would 
want ~ (i^odido 2 tt). Consider Eq. (8) without the sgn function 

present, if <p^ is a constant then the Left Hand Side (LHS) of Eq. (8) 
is zero which implies tiiat sin t <p (I^) - 0^) ] = 0 or tliat 

(p^0«) - <p^0v) = 0 (modulo 2 it) in steady state. However, in the actual 
equation one nollees that if cp^ 0c) is a constant then the LHS of Eq. (8) 

V 

is zero while the HHS is ± G. This is clearly a contradiction^ Instead 

what happens is tlie foHewing. In tlie steady state loeic condition the 

phase estimate (p^ 0c) will oscillate by a fixed amount, A ip , about some 

quiescent value (close to <p ). This can be shown as follows. 

o 

Let the difference bet\veen tlie inpit and VCO pliase be the phase error 

0 Q 0^) , io Ce , 
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We can then rewrite Eq. ( 8) as follows s 

0^ (k+1) - 20^ (1<) + 0 g G^-1) “ -G sgn { sin 0^ (Ic-1) ) + 

t (p^ (Ic+1) - 2(p^ (li) + <p^ (Ic-1) ] 


If (p^ = <p^ as above, the bracketed term in Eq. (10a) is zero, therefore we 
have 

0g G^+1) “ 20g (k) + 0g a^-1) = -G sgn { sin 0 ^ (lc-1) } „ 

Let us assume that a possible solution to the above is 


0^(k) = ~G^ sgn [ sin 0^ (lc-1) ] , 

where 0 < < tt /2o Also assume that 0^ (lc-^1) = G^, so 0^ (Ic) ~ -G^, 

and 0^ (lc+1) = Using the above for 0^ (Ic-l) v;e see from Eq.(lOb) 
that 0 ^ (k+1) =-G-3G^, For 0^ (lc+1) to be +G^ and thus satisfy Eq. (10c), 
G must be equal to -4G^ or G^ = -G/4, So 0 ^ (Ic) will oseillate about 
zero by ± G/4, 

2. Splice Suppression 

Consider the above system operating witli a PCO, The PD will have a 
binary charaeteristic talcing on values of ±lo Let us assume we v;ant to 
track a phase signal or demodulate a phase modulated signal whieh is 
varying linearly with time (i, e, , a frequency offset) . Since we are 
operating in a noisy environment, we would lilce to investigate the 
loop’s spilce suppression capabilities. To do tliis we mil look at the 
loop’s response to a ramp input (a erude appi'oximation of a noise spike).. 


( 9 ) 


(10a) 


(10b) 


(10c) 
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For a PCO in the feedback path the phase error difference equation 
becomes. 


1 + ‘Pjn 

where G = (4 / tt) g , G and G is the PCO gain. Furthermore, let 
us set Gp = S, the normalized step size of the DM (S = S”/T^). 

Let us assume we wish to track a constant phase, (Ic) ~ <p ^ = 0» 
and Ciat at a time instant k = 0 a spike of noise occurs which changes 
^m ~ ^0 ~ ^ ^ linear manner as shown in Fig. 3. (Notice : 

this is an analog representation of the quantized input and is used for 
clarity). Assume initial tliat the system is in steady state, i. e. , the 
loop is traoking the eai’rier with zero frequency error. The PCO phase 
will approximate the input phase and differ by a value less than | S [ , 

At time instant k = 0, the input phase begins to rise at a rate of I'ad/sec 
until it reaches 27T radians, that is, 


Tuk 




9 

9 


k £ 2ir /Tu 


whex'e fxl denotes the least integer greater thanx, 

&nce we do not wish to follow the spilre, the PCO phase should not 
increase by 2tt radians. Instead, since w'e wish to follow the input signal, 
there must be a time (denoted as the "Turnaroimd Time") , IC , for which 
tlie estimate wiU decrease and eventually reloek to (p^ = 0, Due to the 
binary nature of the PD and the use of a linear DM, cp ^ (It) mil increase 
by one step size each time instant until 0 (It) is greater than 77 (Imt less 
than 27 T ), at wMch time cp^ (k) will decrease (Ijpt) = ~1). Thus it is seen 
tliat tilt' PCO phase is a sprite and so the time derivative of the phase is a 
doublet thereby suppresing the spike and impx’oving the SNR (1). For 
k a I? (see Fig. 3) the signal estimate becomes periodic and tracks the 


' ( 11 ) 


( 12 ) 
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oiigiaal sigmil lev jl, = 0» 

The sitiiatiou indicated in Fig; 3 will occur for 7T ^ 0 Gc) <27T when 

o 


k > K so we have 
o 


'<r+<P^ (0) “ S 


< K < 

~ G 


2tt + <p ^ (0) - S 


where > S and <p^ (0) is tlie i:>hase estimate at k = 0, (The reason for 

Tj^ > S is ohidous from Fig. 3, and is equivalent to a slope overload 

condition in an ordinary linear DM. If this condition \vere not so, timn- 

around wotild not occur. ) The above equation specifies the range of 

turnaroiuid time as a hmction of step size and spike rise time. 

It has been shown (2) tliat T^^ (spike) < ^/®XF ^ Ij? ~ 2(3-i-l) 

f is the bandwidth of the IF filter in the receiver front end. This 
m 

factor gives the designer an idea of » a^id upper limit) and so in 

conjunction with some other criteria (e. g. , a minimum step size or turn- 
aroiuid time) he can find the best pai'ameters for his sj^stem, 

C. Song Audio and \Tdeo DM 

Two otlier types of DM processors are preseutljr being investigated 
for use as tlie algoiitlimiG processors. The Song Audio Mode is governed 
by the following set of equations 

y(lj) =yac-l) +Ay pv) 


and 

Ay Gc) « 1 Ay G^-1) j b G^-l) Sb (lc-2) 


(13) 

H- 


(14a) 


(14b) I 


and tlie Song Video Mode by 
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y Qa) ~y (Ic-1) + A y (t^) (15a) 

and 

A y G-) == 1 1 ^ ^ ^ ^ 

^ 2S b (li-1) I 

The Song Audio mode is eharacterized by a quadratle increase in 
step size and tlie Song Video mode by an exponential rise. Proceeding in 
a manner similar to tlie linear DM, we find, for tlie Song Audio mode, 
that 

S K 2 + (s - 2T J K + 2 tt + 2(0 (0) < 0 (15a) 

o R o V 

and 

S K 2 + (s - 2T„) K + 4ir + 2 (p (0) >0. (16b) 

.o' R' o ' ' 

Similarly jor the Video mode we have 

•ir < Tj^K^ + {4S[ 1- (1,5) 1 (0) 1 < 27r, ' (17) 


I Ay ac“l) j ^ 2S 
I Ay (k"l) I < 2S 


Dp Signal Plus Noise 

let us now consider receiving an FM signal plus additive white Gaussian 
noise as in Fig. 1. After IF bandpassing the input signal and obbiining the 
digital format we have 

(1^) = »2 cos C Gw/2j ) + cp^ (ic) T + n (Ic), (18) 

where f^ (Ic) is the jigital informatiGn signal plus digitized noise, n (li). 


If we write n (I^) in its quadrature form as 


n (Iv) = Uj (k) cos (Ic7r/2j) - (Ic) sin (1^7^ / 2 j) 
and assume a high input SNRj we ean write 
fjj = -2 Gos C (Ic;r/2j) + cp^ (Ic) - nj (1^) ] 


1 

where Ug (k) = Ug 0v) /2, 

Using a PCO and a Hnear DM we obtain 


0e = 0e ^e ^ ^ 

+ G^) - (lc-1). 

Letting- H (Ic) = 0 (If) “ n^ (Ic) 

0 

we have 


H (Ic) - II (k-1) = -Gp Sgn E sin H (Ic-^1) 3“ G^) + <P^ G^) “ G<^~1). 

T I 

where n^ (Ic) = Ug G^) “ % G^"*l) ^nd is a Gaussian noise process. For 

~ ^o’ ^ ~ ~ ® ^ we assume that 

the noise samples are index^endent, then under the above conditions we 
see that Eq. (22b) becomes a first order Gauss Markov (discrete) process, 
and so we can api^ly a discrete version of the ChaiDman - Kolmogorov 
equation to determine a steady state pdf, i, e. , the pdf of H (Ic) conditioned 
onHGv“l)(3). The equation for the j)df is 

pic+1 1 =/ ^k pjc I ^o^ 

•wCO- 


where 
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Pi. (H I II ) ~ Piif of H (Ic)- conditioned on H 

iv ' o O 

\ [ z) = transition pdf of H (lc+1) given H (k) = Zo 

Noting that n^.(lc) is independent of H (k), we see from Eqo (22b) that 
the transition pdf (H | z) is Gaussian with 

mean - Ej^ (H | z) = z G sgn (sin z) 

JP ^ 

and 

, 2 2 2 
variance = Var (H | z) = 2 q- ' q; /2 = a , 

Ug S 

where is the variance of n^ and cr 2 iq variance of the original 
noise px*ocess n(Ii), Notice tliat the mean and variance are independent 
of the time paranieter k. Thus we have 

00 

I "" f “ exp (H - z + G sgn (sin z)) 2/2 j J ] ' 

•— OO g IT Q 

• ®lc I 

The range of 0 and therefore of H, tiiat we are interested in is generally 
E -IT, o In Eq. (25) the range is (-•“ > “)® A simple adjustment (4) 
results in the foUow-ing 

^k+i (“ I 'y ==/ ^ (“• \ I H ) a . 

■"77i. 

where 

CO 

• \ 1 Pt (H + 2a7T I H ), 

n=-oo ^ o 


(24a.) 


(24b) 


(25) 


(27a) 
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au(( 


K (H 2 ) = B ”i ^2 exp [ -(H + 2a7T“ z -i- G sga (sin z)V2a 2] 

U= •>{» V "TT GT PS 


g 


(27i3) 


( 28 ) 


so that Pj_ (H [ H^) is periodic modulo 277# 

As k • 4 ®, we look for a steady skite pdf P(H). P(H) would then he obtained as 
tlie solution to the integral equation 

7 T 

P (H) = r K(H, z) P(z) do 

V 21 

'•TT 

The search for an analytic solution of the above as well as immerieal 
amlysis on a digital computer are presently under ^vay. Other items of 
interest to investigate are mean time to slip a C3''cle, location of threshold, 
and tlireshold extension. Tliese problems are being investigated presently. 
Similar stiidies are proposed for the Song Audio and Video processor algorithms. 
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Figure 1. A General DPLL with Nonlinear Processor 







I, 5, The Measuromeat of Li^ht lateusily of Computer Geaerated Pictures 
A. Abstract 

We descx'ibe a metliod of measuring tlie total light energy in a computer 
geaerated picture. TMs method uses Cadmium Sulfide (Cd S) photocells as 
transducers wMch convert the light intensity to an electrical signal. This 
signal is then processed and tlie result is displayed on a DC meter. This 
new measurement technique will pro\dde an efficient tool to aid us in 
optimally recoi’ding computer generated pictures. 


Bo Introduction 

The equipment used in conducting this research is related to both 
communications and imaging systems. The apparatus gives a measure 
of the screen image light energy, which is the output of the video 
communications system. The image is generally discrete in nature, 
the shape of which is approximately Imown. However we want to Icnow 
the light energy so that we ean optimally record the image. In Fig, 1 
we show a block diagram of the overall sj'stem. Figure 2 presents a 
block diagram of the electrical system that was designed and constructed. 

Co Optical System 

A series of eiq^eriments were conducted to determine the optimum 
location of the photocells. It was apparent that the best performance, 
i.e. , sigaai"to-noise ratio, would be obtained by directing the maximum 
amount of light from the Catliode Raj'’ Tube (CRT) on to all the photodiodes 
used. The light had to be directed to the photocells by a method tliat woxild 
not interfere with photographing the image on the CRT sereen. We also 
had to insxu'e Hiat the amoxmt of illumination reaching the photocells was 
independent of the position of the CRT dot and depended only on tlie 
intensity of the dot. It was found tliat placing eight photodiodes in a 
circle aro'-md the edge of the Polaroid camera lens gave satisfactory 
results, A sketeli of this is given in Fig. 3. 
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It was fo\uid that whea tlie eiglit photodiodes were eoanected in parallel 
their total resistance varied from 500K with maxinaiun illumination from 
the CRT to 5M in total darkness. This implies tliat each photocell has a 
resistance of 0 (5001^) = 4M v/ith maxirnmn illimiination. Emplojdng the 
photocell manufacturer’s curve displaying cell resistance versus the 
illumination, we were able to extrapolate to find tliat the illumination 
per cell was 0, 005 footcandles® 


Do The Detection and Measurement System 

The oplicai information obtained from the photodetectox’s is processed 
as follows, Opei’ational Amplifier (OA) 1 (see Fig, 4) is a buffer to trans- 
form the light intensity, which is proportional to the photocell resistance, 
into an electrical signal, m(t). During the time interval T^. (see Fig. 6), 
OA 2 integrates m(t) as follows : 


v(t) - 



I 


T < 


m (T) dr. 


w'here 


Tj.= the integi’ation period (switch open), 

.T ~ 1/E C = the integratioxx time constant 
o ' o o 

and 


T = the scanning pemod for one video frame. 

Just prior to the dump time, T^^ (see Fig, 6), switch Sj^ closes and 
a final reachng of v(t) is taken and presented to the input of OA6. The 
output of OA6 is then displayed on a DC meter. When ^ opens, closes 
and dumps tlie accumulated value of v(t). tlien opens and the cycle 
repeats during the next scanning intexval. The synchronization circuit 
of Fig. 5 generates the required eontrol signals to operate switches 
and Sj^. 



79 ) 


E, Overall Perforjnance 

The desig'aed circtiits were found to function reliably imdei* different screen 
inteasilies and complex illxmiination patterns. In addition, tlie frequency 
characteristics of our processing and synclironization circuits are sucli that 
we can operate over a wide range of frame scanning time wiih little change 
in performance. Thus we are able to obtain a sufficiently good measure 
of the light energy in a video frame as it appears on our scanner. Using 
this information to aid in setting the exposure of the Polaroid camera we 
are able to more precisely record a video frame. 
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Pig. -1 . The overall system, 





Fig. 2,. Block diagram of the electrical system. 
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II, 1 Testability Enhancement in Digital System Design 
C, S. Chuang and S. J, Oh 


Digital system applications demand high capability to 
insure the correct operation of a system, Rapid real-time fault 
detection is essential to satisfy this aspect of the system performance 
criteria. 

Most of the existing techniques to detect the stuck type 
failures require, in general, an application of long input sequences 
generated by a hard core computer according to a* complex algorithm. 

On the other hand, v?ith the recent advances in semiconductor technology, 
the complexity of circuits fabricated on a single LSI chip tends to 
increase rapidly. The diagnosis of systems utilizing LSI components 
poses an extremely difficult problem because of the structural 
complexity of the components and their limited accessibility. 

The problem of augmenting' the testability of a digital 
system becomes the ''•^portant design criteria. Several approaches to 
enchance the testability of digital system by means of redundant 
hard'vvare have been developed recently. But, they are still not 
efficient enough to provide real time fault detection for stuck at 
fault model. 

One approach to satisfy this i* equir ement is to include the 
fault detecting capability of the system from the initial stage of the 
system design. New teclmiques of employing hardware redundancy to 
reduce the number of tests to detect the stuck at faults are proposed, 
so that the criteria of real time fault deteetion will be possible for a 
digital system. - 

Some works have been done for tMs effort, two papers have 
been published recently. The results show two tests are enough for 
stuck at fault detection of any eombinational circuit implemented vdth 


the proposed PLM's and CSI's, and four tests are sufficient for 
detecting stack at fault of the combinational logic portion and single 
permanent malfunction of the flip-flops of a synchronotis sequential 
circuit implemented with the proposed PLM's, CSi's and CMM's. 
Also, the test patterns are automatically generated by the systematic 
design procedure. 



88 ) 
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DESIGN OF EASILY DETECTAULE COMBmATIONAL 
AND SVNCiiRONOUS SEQUENTIAL CmCUlXS’^ 

Ovln Slicns Chunng nnd Bo Joung Gh 
, DQpnrtiiu'nt of Elootricnl Engliworlng 

Tho City Collogo of The City Unlvoralty of- Now York 
Now York, Now York 


* Abatrnct 

Tho problem to roiUuxj llio number of tonta for fault clotcotlon in both combinational loglo 
clrcuUa and Bynchronoitn Bcquontlnl circulta la Invoallgatod, UllllzlnR programmable lo- 
gic moclulu and controllable mcmoi-y modulo, ayatomatlc dcslirn and dotoctlon proccdtiroa 
nro described. 'I' wo loala arc aufficiont to detect nliy Btuck tj'pc faults of combinational 
ch’CulL, and four tcata arc neccfionry and sufficient to detect any aluck typo faults of ele- 
mentary logic gales and the malfunction of fUp-fiope of synchronouo ooqucntlal circuit 
ualngD fllp-rinpn. 


I', INTItOnUCTlON 

Digital syatem appUcatlona demand high reliability which 
implies an increased capnbllHy to a.ssure tho correct 
aystem operation. Rapid real time detection of stuck 
faults la often casentlal to Insure the Integrity of the sys- 
tem performance. Gno aspect of the system to meet 
these pcrformaaco criteria la to Imbed tlic fault detec- 
ting capahlUly within the syslcm from the Initial stage 
of system design. 

The existing teciinf<iucs [l]^Dl] to detect the stuck Laulls 
require, in genor.al, applic.atlon of long Inpnt sequences 
generated by .a hard core comimlcr according to a com- 
plex algorltlim. On tiic other hand, due to tho recent 
advances In .semiconductor technology, the complexity 
of rifculls faliricatr.d on stnglc LSI eliip tends to Incrcaee 
rapidly. The funcUonal dliigno.-ils of syalcma utilizing 
LSI Gbmijonents po.sca a difficult problem. 

TcchnleiuoH t-l}, [h] were Invcntlgaicd Invarieua dlrec- 
llons to sinipVlfj tiie f.-nilt detection by u.sing hardware 
rcdundanc}'. A new aiiprbach is propoaed hero which 
udll allow us lo |x!rform fault detection in leal time. 

Tlic following dHinlUons are Ininxlucodt 

(1) Input .ScnuiUvIty: Tho ocn.'iRIvlty of an Input paltom 
for a logic gate in defined no l!>o number of Indlvlciiial 
Inputs In the pnitern capable of ncnalng failures in tho 
Kate. ,• 


For example, (00) Is the moot senoftivo input pattern 
for two-input OR gate olnco any one of tho input ntuclc- 
Bt-i will affect its output. The most oonsllivc Input 
pattern for the N-input OR gate is 00. ..o'(N O's). By 
extending tlits idea, the most nensL’-iVo input patterns for 
the ordinary combinational logic galea are cotablislicd 
ns flliown In Tahie-1. 

(2) Sensitizing Patlu A path which propagates the fault 
information to tiic observable outputs. 

“ The stuclc type faults can be partitioned into two cla.saon 
by tlie logic function nature of tho elementary gates. 

(3) Type I Fault: Tho fault set consisting of sluck-iit-l 
(s-n-l) fnuKfl of the inputs of OR, NOR and the output of 
^'.^ND, and tho sUiek-at-0 (s-a-0) faults of the inputs of 
AND, NAND and tlie output of NOR. 

The complement of Tyjw 1 fault is defined as Typo IT 
fault and they form tl)o complete class of stuck ty[Xj 
faults. 

It is fouiiil that tlic Tyixi I fault information in a co.nbln-, 
ntlonnl logic circuit can always propagate to the obacr- 
■ vablo outputs of the circuit if and only If tho inputs for 
all gates arc tlie most sensitive bipul patterns for the 
rcQpoctlvo gnlns. 

(•1) Program mab'io Logie Modulo (PLM)s A tnuUlplo 
input -ninglo output combinational circuit la defined to bo 
n pro gram inn bio legio modulo If each of tlw logic func- 
tions porformoci by Uw module can bo uniquely siveciflod 


* This work was supported in part by NASA under tho Contract NA89-13940. . ^ ' 
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by n fi«t of control niRnaln. 

(0) Control Kli;n:il Invorlor (C.fll)t An KN<‘bitit<x> ORriiIo 
I n wlilrti ont> tif Inputn In nju'd mt lovot oontml utiinnt. 
Tho nivolflc loi;Ii! ftinotlon of l’l,M for ANn.OU, NAND 
nuit NOll itnli'M is itosrrttk'il In Tnblo-il. TIumv nro 
inanj’ xvnys to Imploinonl n itt'shx'it I’tjM. KI|;. 1 nliown 
one of Itio Imploon'ntntlou of t’t.Ms for two-lniml i;iitofii 
TIu' Konoritl n'lHX'Hontdtion of l’I,Mi5 Is-nliowii li'i Fl(;.2, 
The cl rcufl- const ructed l)y tlio iisn of PLMs :ind CSIn 
bits three moilea of operation! NOllMAf- inode, TEST 1 
mode ami TEST 2 mode. If we clnnole one clnsa of 
stuck type fiiulls wbiclj can Ixi defected by 'I’EST 1 mode 
ns Typo I fault, then this class Is equivalent to the 
Type II fault of TEST 2 mode. Similarly, the Type II 
fault of TEST 1 mode is equivalent to the Type I fault of 
TEST 2 mode and It can bo detected l)y TEST 2 mode. 

So the complete clas.s of .stuck type faults can be detec- 
ted by setUnst tlie circuit to TEST 1 and TEST 2 modes. 
For slRnlc stuck type fault as.sumptlon (S-a-l,s-a-0), 
any combinational circuit with prlmiiiy output observa- 
ble can be constructed with PEMs and CSls so ns to de- 
tect the stuck typo faults with two tests Inpul: patterns 

[dJ. 

(0) Controllnbla Memory Moiiulc (CMM): A memo' 
element with direct set and ro.scl c;ip:d>iUty whose out- 
puts arc convoriablc by two oxlcnial control alicmtln. 

A pcnornl muilel of CMM In shown In Its out- 

liutii nre <11 ami qj (whore qi-y|'i>C,^,j , qi yi'l'Cy|), and 
Cyj/Cjj dcpcnd.s on the control sli'iiiils of It.s sucec-ssor 
PLM and dfrect set and reset signals of the Cllp-fiop. 
Gonsider a Delay flip-flop CMM a.s sliowni in Fig. 4. 

Type I malfunction Is defined as the failure of a Delay 
CMM which falls to propaRate 0 rs'hen D=0. Fig. 4 (a) 
shows that the s-a-1 of D, 5 ',q and Cy are equivalent to 
Type I miilfunctlon. Type II malfunction is defined as 
the failure of a Delay CMM which falls to propagate 1 
when D=l. Fig.<l(lj) allows tliat flic s-a-0 of D,y,q and 
Cy are cqutvjilcnt to Type II malfunction. It is suffi- 
cient to detect the mnlfuncLion of CMM for fault detec- 
tion of the flip-flops portion of a scque.nUal circuit. 

The relation of control signals betwee.n the CGnsecutlvo 
gales are shown In Table-lII. Cyj/Cy. of CMM Is set 
according to Table-IV. Syalematic fleelgn and cleloctlon 
procedures are developed. Two c.'cainplos are used to 
Illustrate tlie procedures. 

II. COMBINATIONAL CTUCUIT DESIGN 

Consider the boolean funellon f-b{n'+e'd)+(lxi) ' (n Mc'd) ' 
as shown in Fig. 5 (a). Us controliablQ elrcuil is con- 
Blructcd as shown in Fig. 5 (Ij). All tlic control slgnala 
C|, Cj, C, art! net to he 0 during the normal o(x;rntlon. 
For test oimraUon, wo apply the two tests neoording to 
the detection proccilure na follows: 

TEST 1i Set C,CjC,=0U fl’EST 1 mode) and apply 
input lent pattern nbcd^OtOI., If the observable outputs! 
(-1, then the circuit does not have atuck fault of Tyjia 1. 
Gllionvisc, It han at least one ntuck fault of Typo 1. 
TEST 2; Set C,C,C3=101 (TEST 2 mode) and apply 


Injnit tost pnllorn nbcd^lOlO, If the olisorvablo output 
f!-0, then Ihn circuit <tin*n not hnvo ntuck fnult of Typo II, 
OMuirwIso, it halt at Inanl one ntuck fnult <if Tyjio II. 
Onco Iho (dreult jimin ihmm lwr» touts, tho circuit In 
guaranUHid to bo fixm of sluck iil-l mul nluck-nt-O fmilLa, 

• III. eYNCllIl()N()ll.S SEQUENTIAL CIIICUIT WITH 

D-FLIF-1’L01^> 

Consider a two bit shift right/shift left rcgletor ns 
shown ill Fig. C(a) in which Xj is the serial Input for 
shift loft, Xj Is tho serial input for shift right and Xj 
is the shift control. It is converted to an easily test- 
able circuit as shown In Fig. C(b), where is the 
monitoring c._iput. For normaToper.atlon 'wc set all 
tho control sljpinla to be 0, For testing, four tests are 
applied according to the detection procedure as follows! 

TEST l! (S(jR^=10): Set CiCjCjCy C =01100 aixl 
apply input tost pattern Xj X 2 XjkIW. ^Observe the 
output Zm, If Zm=l, then tho combinational logic does 
not have stuck fault of Type I. Otherwise, It has at 
least one. 

■ TEST 2: (S,|R£] =00) ! Observe the output Zy Zy , 
if Zy Zy^ =00, then the flip-flops do not have ihe ^ 

TYPE II malfunction. The disagreement Implies tho 
corresponding filp-flop has the Typo II malfunction. 

TEST a: (B() ]\| =01): Sot C, Cj C, Cy-j Cy^ =10100, nnd 
njiply Input tost psitturn Xj X 2 Xj^OOl. Obuei-vo tlio out- 
put Zjy^, If Zjj^=0, then the combln.-itlonnl logic does not 
have stuck fault of Typo H. Otherwise, it has at least 
. one. 

TEST 4 ! (Sdlld=00); Oh.serve the output Zyj Zy , 

• If Zyj Zy =11, then the flip-flops do i;ol have the 
Ty)>e I malfunction. The disagreement Implies the 
corresponding flip-flop has the Type I miilfunction. 

Once tho circuit passes these tests, it Is guaranteed 
to lie free of stuck at 1 and 0 faults of logic gates nnd 
the malfimctlon of flip-flops. Notice that the Exclusive 
OB unit In tho CMM la not necessary for this siiecial 
circuit. 

rv. CONCLUSION 

TliQ iitMlwntion of FLMs, CMMa nnd CSIs nssures that 
two tests can detect iiny combinational circuit nnd four 
tests, for any synchronous scqucntinl circuit using 
D-flip-flopa. In addition, tho test pattern can be onally 
genorntod by a syslninatlc procedure so as to allow 
fault (ictcctlon in real time. 
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Tablo-II, Tour basic programmable logic modulee. 
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Fig, 1, An implementation of four baoie 2-lnput PlMn 
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Fig. Z. The general model of PLM and symbols 
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Gate 

Its Ruccennor 

Tho Relation of Their Control Signal 

OR 

OH/NOR 

KAMI! 

"and/nand 

D1!='FERENT 

AND 

01?/N0N 

DIFFERENT 

AND/NAND 

SAME 

NOR 

OR/NO R 

DIFFERENT 

AND/NAND 

SAME 

KAND 



ON/NOR 

SAME 

Ara/NAN^^ 

DIFFERENT 


fable-III. The general rule of determining the PLM 
control signal settings. 


page IS 

aUALlTY 


INPUT 

(J,K) 

{S,R). 

(D) 

Vs) 



7K — : 

-"c: 


Clock 
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The Siicce&sor of The State Variable 
of T)ie Flip-Flop (q^j^ or ^i ) 

1 

S^Rd= 10 

SgRg= 01 

PLM^ or PLI/i^ 

Gyi=C^ " 

Cy.= Cj 



PLMg or PLM^ 


Gy.= Cj 




* Cj is the control signal of PLMs 
Table-IV-. Control signal of CMM witii respect to the pre- 
determined control signal of its successor PLMs, 



(a) (b) 


Fig. 4. (a) Type I malfunction of Delay 
(b) Typo II malfunction of Delay 






Type I f aults=[a^ ‘ '^G ' ' ^0 ‘ ‘ . Iq * , f ^ 

Type II fault a=[[aQ > * Cq i d^t e 1 5u , i^, , k^, 1^ Pj^s f^^j 

Where r.^ ~ the line x stuek-at-k fk- 0 or 1) 

Fig» 5* Combinatienal circuit design example. 

(a) Original circuit, (b) Easily testable circuit. 



Typo 1 
Tjps II 

Fig. 6. Synchronous sequential circuit design example. 

(a) Original circuit, (b) busily testable circuit. 
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Cliln Slii'iii: CluintiR timl Ko .Iniiir nil 
DciiiirlrnpnL of IliocCrlonl l•:tl(^llu'(^!•|n^' 

Tho Glly CollcRo of Tlio Clly Utilvornlly of Now York 
Now York, Now York 


ANSTItACT 

Srvrral nuprondicn to onhniu'o llio loKinhlllty of 
S.vstoni Iw monns of mliimlmit liiinhviirc !i;ivo keen pro- 
posed veeentty. A new w;iy of cinjiloylni' hnnlwnro 
rcdiinflnney to reiluro llio lutnilior of IcKln for f.’uilt dclcc- 
Uon in boUi comblnnllonal amt aynelironoiis (;c(|iionl.lal cir- 
cuits is Invesll|;alct1. An npproacii Is prcsonlod for ntlll- 
zliiU ayslenv.'lln rediiiulaney to simplify tieslim work. 

Motlol.n f(>r Pl.M (prnitramm.alilo !o[:lc module) and CMM 
fconlronalvlo niemory miidiile) are deplctetl. Kyslematic 
(to.s|Rn and didectlon procednre.s are described. .Itslnfr • 
these procedures, an ea.slly le.stalite e.lreuR (for niUGk . 
fmilts) e:m be destpnixl. In eonlvasl to the onrller resnUa 
on fault defi'cllon In lo(Ne elrnill/'"”’' two fe.sts aro noodod 
to dct.eel.any sUiek fault.s of comb liiiit Iona! toido cire.ult. 
Four lest.s are necessary and sufflelenl to detect any stuck 
rnulls of clement a jy lopic pales and the inalfiitictlon of 
nil>-noi).s of symehrnnons sequonUai circuit using Delay 
flip-flops or Trigger nip- flops. 

I. GOMNINATIONA L N'KTWOIIK 

If a logic gale can not )>erfonn llic desired logic func- 
tion with re.spec.t to tho Input p.-itlern applied, then there 
exists come pennanent (aidls In the logic (;ato. These per- 
manent faults arc defined an r.Inck fauil. The stuck fault 
can bo ciiiegorir.cd Into two types, lhat Is, stuck at logic I 
(B-n-I) and .stuck at logic 0 (n-a-n). 


Con.sldor a two-input AND gate). There aro four 
possible binary Input pattema (do, 01, 1.0, 111. Rome In- 
put fallure;i will not cause the output to change, 1. c. , they 
■ arc mn.nkod. For oxaraplc, If no Is unccl as the Inputs to 
• ' an AND g.atc, the output will be affected only when both Ln- 
puts Gtuck at 1, and all the other poc.siblc stuck failures 
! will be masked, \Vc Introduce tho scnsitlvliy of an Input 
pattern to a gate to cla.sRlfy tho pattern nceording to its 
ability to avoid fault masking. 

■I 

The nensllIvUy of an Input pnllcm to a gato Is doflncd 
na the mimlier of Individual Inpulo In tho pattern capable of 
Dcnalng failures In the g.-iLo. 


If wo ajiply the Input pattern 01 to .an AND gale, then 
only the lu]iid "0" .slnclc-at-l will affect the output. If wo 
apply Ihc Input pattern II to tho AND gate, then any or 
both "1" irUick-at-0 will catino tho change of output. Hence 
the moat sennitivo Input pattern for tho AND gate Is 11. Wo 
find that Ihe moat scn.slMvn Injiul palleni for an N-lniasL AND 
gale Is II. . . I (N I's) by estcndlng tho definition. Tho most 
ncn.sUlvc Input palti-'nia for ordinary 'OU and NOU gntoa are 
all Kcro Input, and for ordinary AND and NAND gate are ; 
all one Input. For N.vclusivo-OR and Eqnlvnienco gntoa, , 
nil tho possible Input pattemo have equal ochalUvlty, 


* This work was lUippniied in part ly NASA-IIounlon under 
the NAHA Contract NAJt B-13040, . . ,. 

' ■' y •< • I' 


original page N 

OP POOR QUALIIY 



A palli Is defined as Bennlllr.Ing pnlh If Iho fault Infoi-mn- 
Hon e.nn pmpagnlc along the path to the obnorvablo mitputfi, 
Tho ;iluek lyqie r.anll of any combinational circuit can bo 
paiHItioned Into two ilistlnc! classes according to the logic 
function performed by the clcmenlai'y gato. One clann la 
tho stiick-al-1 (B-a-ll faults of fan-ln of an OH or NOIt 
gate .ami fan-out of a NAND gato and the atuck-al-O fo-n-Oi 
fanltB of fan-ln of an AND or NAND gate ami fan-out of a 
NOrt gale. Tliis clasB la (Icnolcd aa Type I faullo. Another 
class Is the complement of Tyqio I faults and la denoted ns 
Typo 17 faiittn. Wo can then conclude the following theorem. 

‘ / 

Theorem 1 ' 

Tho Typo I fault Information In a combinational logic 
cire.ult always propagates to Iho obnorvablc oiitpuls of tho 
circuit If and only If tho circuit topoUny lo arranged In 
such a way that tho Inputo for all gaten are the moat Bonat- 
tlvo input patterns for the respective gates. 

I 

Proof 

1, Sufficient Condition If wo use the moat oonBitlvo 
Injmt patlorns as tiio tcist Inputs, then any otuck at 1 or </ 
fault of Typo I faults will alter tho output of tho gale. Thia 
W'Ml in turn affect tlic output of tho Bubsequ cat gntoa con- 
tiQctod to It by ch.aln process, and the fault Information of 
Type I will eventually propagnto to tho obooiwablo outputs, ■ 

2, Noe OB .s ary Conflltlon Since tlio most sensUlvo In- 
put patterns will not mask any eet of fault Infox-matlcn la 
the Tjt° 7 faults, tlio necessity is obvious. 

In tho Colknvlng discussion, let tho lino x stuck at tho 

logic value k be denoted as x (k=0 or 1). 

K 

Dx.'implo 1 

Consider the circuit showTi in Fig. 1. The circuit ' 
anllsfios the conditions of Lhoorem I beennse It la posalblo 
to find n test Input pattern such lhat inputs to every gate 
arc most nonslLlvc. Wlicn wo apply an Input pattern 
nbcdcfg'-ll 01001 to this circuit smd observe K=1 In rusponao 
to UiG liiiiut, then the circuit docs not h.ave Tjqio I faults, 
where Type I rnultc= ja^, b^, c^, d^, c^, g^, I^, 

r f 

L, k^. Ip, z On the other hand, If z=0, tiicro exists / 
at least ono Tyne I fault. Slniilnrly, If a new circuit Is ’ 
genoratod by Intcrcliiuiging OR to AND, and NOR to HAND, 
or vJco versa In the circuit of Fig. 1, wo can detoct the 
compUnv.ent of Typo I fnnlts of tho original clrouU from tho 
now circuit with tho compleinont of tho input pattern used 
for dctcetlng tho Ty^io I faults In the original circuit. Tldn 
strucUirnl conversion for testing can ho porformod by meann 
of the progrnmrnablo logic moclulos (PLM). 



' lh'3 


94 ) 


tj_ >j. 




f- 

E" 


ORIGINAL PAGE IS 

— OF, POOR QUAIOT 

I y |j 

-4:>. ■ 

- 

iL> 


Fie. 1 


A imiltlpto InjiuL-ahiRlo mit|nit comljinalinual cli’cuil 1 b 
dcflnctl lei hi>. a pfoitiMimuiiliIo lopk- mneUilci If t'iU'h of Ihe 
loKic fuiK’Uoiis (uTfnniif'd by llie.- iiioeUili' cem I»es unlviuuly 
ope:clfi«l by n aot of control slmials. 


The spocJflc IorIc rimulion.s of PLM for AND, OR, HAND 
nnd NOR Rnlcs arc flcscrlbctl In Table T. There arc numer- 
ous ways lo Iinplcrncnl the dcsireci I’DM. Nig, 2 kIiowb one 
of the lniplonienln>' Ion of PLM’s for hvo-Iniiiit and ono-ouLr- 
put RaloB. The gcnorhl representnUons of PLM’a are 
shown in Fig. .1. 




u 

ll/.V 


Module 

Desired l.ogic Function When: 
C=0 C=l 

IM.M 1 

AMD 

OR 

I'T.M 2 

OH 

AND 

I’LM 3 

NANI) 

NOU 

IM .M <1 

NOU 

NAND 


Tabic I 



PLM 3 PLM 4 


Fig. 3 

Corollary 

Any stncic fault Information In the combinational logic 
network will propagate to the observable outputs If and only 
If the structure of the network aatlstlcs theorem I and all 
the gates arc intorebanged with the suitable typos of pro- 
grammable logic modules. 

proof 

The detection of Type I faults la obvious. The uttllv.atlon 
of programmable logic modules allows us to derive a simple 
test pattoin for tlie detection of the complement of Tyjie I 
f.aults by setting the control signals for the PI/M's. In other 
words, any stuck type faults can be detected by a single In- 
put pattern and its complement. 

The circuit constnteted by the PLMs has throe circuit 
modes, 1. c. , NORMAL mode, T15ST 1 mode, TEST 2 mode. 
If we detiole the clas.s of stuck type f.ault that can bo delected 
by TEST 1 mode as Type I faults, then It Is the Type II 
faults of TEST 2 mode, so It can be detected under TEST 2 
mode. This implies that the complete stuck fault class will 
be detected under TEST 1 mode and TEST 2 mode. In the 
following, we let C^G2=00 indicate KORMAL mode, Cj^C^=01 

indicate TEST 1 mode and C, C =10 Indicate TEST 2 mode, 

1 ^ S 

where C^Ggate the control signal of its corresponding s 

PLMs. Notice that the faults at control signal are equtva^ 
lent to the corresponding fault class at testing. 

■ \ 

Esnmple II j . 

Gonnidcr the combinational circuit for z = 

(x.jc )*(X X )' an shown In Fig. 4 (a). Wo use the PLM3 in 

' J 4 w O 

place of each NAND gate and connect the control slipialn ns 
shown in Fig. 4 (b). For normal operation. Betting C,| 0^=00 
(normal mode) , all the PLM2‘s function as HAND gntcii. 






l^or li’stlni';, wv BC’l C' C -lO (TP.RT 2 moilc) m«l(x x x k )-r 

L ^ 1. A *1 'I 

(OOOIU, consequently, nil the Rales In oiUl level hocoino NOT! 
nml nil the Rates in even level lieeume NANO. If we 
then Rnow Hint there exists at least one Typo It fault. Then 
we set C C^-ot (TEST I nioflo) nml (x,x„x, .s Now 

nil (he Rates In odd level become NANO' s ami .all the Rates 
in even level NOU's. If there exists nt least one Typo I 
fault. We olisci'vc Ih.at the tost input .sequence Is short ami 
easy to protiucc If the rembinatlonal network la constnicted 
to s.itlsfy the conditions of Thcoroin I, 






(b) 

FIr. d 


Exee|it for n I'cslrleted clas.s of ci reulla such as the one 
lllusLi’alcx! In Example HI, we can eonvei’t n circuit to a 
speeiflod .‘itnieture which will satisfy the retiuiremcnt of 
Theorem I durinRlhc testinR liy projicrly .sctllnR the control 
Blgnal.s of PLMa'. The choice of control signal sottinRS for 
H given circuit is therefore important. The relation of con- 
trol signals between tiic consecutive gates are established 
as shown in Taljlc II based on Theorem 1 and Its corollary. 


Gate • 

li.s .Suceoasor 

KeJatiem of Their 
Gontrol Signal 

OK 

OR/NOR 

.Sanic 

AND/ NAND 

Differ ont 

AND 

OR/ NOR 

■ Different 

and/ NAND 

.Same 

NOR 

OR /NOR 

Different 

AND/ NAND ' 

.Same 

NAND 

OR/ NOR 

Sainc 

AND/ NAND. \ Different ' | 


Table II 

Example Til 

Gon.sidcr the liooican expression z-x’y'-' xy as sliown in 
Fig. 5 (a). It Is impossible to construct the cli-oult to satis- 
fy the conneet ion requirement by the technique of setting 
control .slRnals. 'J'he. only way to solve this prdilotn is to 
apply a conlroltcd signal invertor (CSI) at the proper points 
In the Glrcviit to make the elreuit aatirify the cotnimons of 
Thcoi’cm I. Tlie CSI is aotunlly an Exclusive Oil gate In 
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which one of Hie Injait is used as level c'lntr l signal. So the 
circuit can be rcconatnictcd by vmlng PLM's and CSI as 
sliovu In riR, 5 (b). 

For normal operation, we act C.C C =000, For leating, 
wo set C,G„C =011 with (xyl^fOO), and then set C,G C.=101 

1 <: »1 X 2 3 

with (xy)=(ll). 




(b) 


Fig. 5 


In our Bubnequimt discussion, an input variable of the 
circuit will 1)0 said to be a prlniniT input if the variable Is 
included in the minimal expression of at least one obserV'nble 
chxujt output. We then conclude the following corollary. 

Gorollarv II ' 

Given tiny combinational circuit, If each olisorvnblo out- 
put of the circuit can be represented ns an explicit switching 
function of circuit lnput.s, and caclt circuit Input variable Is 
a primary Input, then the circuit c.an be constructed with 
PLM’s and CSI Bo ns to detect the stuclr faults with two test 
input patterns. 

n. SYSTEMATIC DESiGN PHOGEDURE 
Fiai GOMHINATIONAL CHtCUlT 

The following definitions arc introduced to fiscilitate the 
discussion of the systematic design procedure. . • 


Predecessor gate 


A gate Gj is called the predecessor of a gate G if the 
gate output of G is one of the inputs of Hie gate G..^ On the 
other hiuid, G^ Is the successor of Gj. ^ 

Level of gate G 


Let the level of gate G be denoted ns L(G). Then L(G)=1 
If the gnte output of G is tlie primary outimt (observable). 
Otherwise, L(G)= the maximum level of the successors of 
O -n. • 

Gonslder the circuit shown in Fig, G, where Z Z are 
1 12 
primary’ output. Then L(Gg)=L(G^)=l bocmiso their gate 

outputs are primary outputs, L<G )=L(G )+l=2 and L(G,)’-= 

max [l( 0„), i-l=2+l=?i according to Oio definition, 

11/3 













(a) 



(b) 

Fig. .6 


An nlRorllhm of (he nyRtcmatlc tlciUjfn procedure for the 
conthinaUonal circuit In «! from the property of the 

proposed PLM'B and the Introtluced dcflnitlona. Tlio flow 
chart iH shown In Tig. 7 nnd the clctalVn of the procedure are 
described below. 

• Systematic design procedure 

• Step 1 Find otit the level of each gate of a given combina- 
tional circuit nnd reconstruct the circuit to a controllable 
‘circuit by using PLMl, PLM2, PLldtl and PLM4 to replace 
the AND, OR, NAND and NOIl gate reBiJectivoly. Denote the 
control signal of the predecessor as CP and the logic value 
of the primary Input as PI and the roforcnco control signal 
ns ns. 


St ep 2 Let 1=^1 and J^tho maximum level of the circuit 
and the control signal of PIjM1/PLm; 1 In lovel-i bo Cjand 
that of PLM2/PLM4 In levcl-1 be C , where C^is the com- 
plement of Cj. ; / 

Step 3 Plclc one input of lev.el-I which is not picked yet 
nnd truce back to its predeoesBor, If the module from which 
the input 1b picked is PLM1/PLM3, go to step 4. If the 
module from which the Input 1b picked is VIM2/PIMA, go 
to step 10, 



IwUftt Ir. ttift. prl^^ennor 7 


to V ■ ■ |t.,ot nSs CS of ihifl nindutfl ,1 
... 1 ^ — tr— |l» d^tlsfliiina.nr niit ?1 — 


PLM27PliMl Pl/Ml/PLKM Pflma.ry Input PL.MZ/PL.M5 Pi.Ml/PLM4 Primary Input 

J1 J _I_ _i j 

[>= iisiLXkS cp= I 


|cprj(sJ7J 


NO 


| cp= ruSIl -diS IpiOiS’k ^S? iLtt CPt R.sil It-Bt CP»l^ Ibet Pjt KS.I 


YESl 




JvyKAt tcTho prodioceaBbr ?) 


t 

\ 




U/.1 


I 






stop •! I.cl Ilf? Ijo cqtml (o Uio control nii;nnl of tliln 
PIjMI/I’LMH inotUtlo. Then check to nee wliothor Iho control 
slitnnl of its prcOocoafior lo tlcfinai or not. If yon, go to 
slop S. Olhenviso, go lo step 7. 

Step 5 If the prcduccosor Isi 

(1) PLM2/PLM;i, check "la CP=RS?" If yes, go 
lo step 6; olhcinvlsc go to step 

(2) PLMl/PUI-l, check "Is CP=^KS7" If yes, go 
to step 8; olhenviso go lo step C, 

(3) Prlinnry inpul., chock "Is Pl^nS?" If yes, go 
lo step C; othonviso, go to -step 8. 

Step s Insert a GSI In series with this input. Then go to 
step 8. 

Step 7 If thc.i)rctlcco8sor !s! 

(!) PLM2/PLM3, net CP to bo tho complement of 

ns. . 

(2) PLM1/PLM4, set CP Co be equal to ns, 

(3) Primary Input, set PI to be the complement 
of ns. Tlierv go to step 8, 

Stop 8 Cheek to see whether each Input of the lebel-I is 
picked or not. If yes, go to step J); oChenvlse go back to 
step 3. 

Stop 3 Cheek "Is I Larger than J ?" If yes, tho proce- 
dures are completed; othcnvlse let 1=1+1 then go back to 
step 3. 

Step 10 Let ns be equal to the control signal of this 
PLM2/PLM'l modulo. Then cheek to see v/hether the control 
signal of its predeeessor.ls defined or not. If yes, go to 
step 11. Otherwise, go to step 12. 

Step 11 If tlic predecessor Is: 

(1) PLM2/PLM3, check "Is CP=RS?" If yes, go 
to step 8; othenvise go to step 6. 

(2) PLM1/PLM4, check "Is GP=RS?" If ye j , go 
to step 6; otherwise go lo step 8. 

t 

(3) Primary input, check "Is PI-R.8 7" If yes, go 

to stop 8; otherwise go lo step 6. • ‘ 

Step iz If the prccicecssor Is: ' ‘ 

(1) PLM2/P.LM3, set cJp to be equal to RS. 

(2) PLMl/PU\14, set CP to be the complement • 

of RS. 

(3) Primary input, set PI to bo equal to RS. Then 

go to step a. f 

V i ‘ 

Fault clolcctlon procedure 

Pmtil dolccUoh pi-oeofls for tlie trnivslntcd clrcitli, la easy 
to follow. Tho IcBl Input patlorn In nlroi^cly gencroted by too 
denign procedure. Two tests can detect all otuclc faults 
(Typo I and Typo i 


Teat 1 Sol C CjjC - Oil and apply the corrcnpondlng lent 
input paficm. li the observable output of PLM1/PLM2 is 1 
and that of PLM3/PLM4 is 0, then the circuit does not have 
any Typo I fault. Olhonvlac, It has nt least one. 

Teat 2 Set C. C^C =101 and apply Ihc corresponding teat 
input p,ittoi-n. Iflhc nbscrvjiblo output of PLM 1/PLM2 io 0 
and that of PLM.'i/PLM4 ta 1, then tho circuit does not have 
any Tjto II fjiult, Othorwiao, It has at Icaat one. 


Once the circuit passes theso two tests, it has guaranteed 
to be free of stuck faults, then set C C C =000 to per/oi'm 
lie normal operation. 12 3 , 




Fig. 8 


Exarmrlc IV 


Consider the circuit showr In Fig. 8 (a), where z. \ are 
the observable prim.ary outputs. We first defiire the level of 
each gate according to the definition, then the circuit is con- 
verted to .1 easily detectable circuit as shown in Fig. 8 fb) by- 
applying the systematic procedure. The label for cadi line 
Is used to define the T^^jo I fault and T3'pe II fault. Tho input 
test pattern (abcdefg)=(C„C 2 G e c c C ) is generated. Two 
tests are applied ns 'follows: . 


Test 1 Set C, C„C-=011 and apply the test pattern . 
(abcdcfg)=(ll00lll). If 7,2=11, then the circuit does n.ot 
I ave any Typo I fault, where Type I fault = [a^, b^, o^, d^^, 

®o* ^0’ S' ’’o* ^0’ ^0’ *'l‘ *'o' ^10’ ®2ol. Other- 

wise, it has at least one- 


TOat 2 Sot Cj^OgC^^lOl and apply too tost pattern 
(nbcdofg)=(O0ll000), If circuit does not 

have any Typo II fault, whore Typo II fault= [hj^, bj^, **0' 
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Cj. fj. ij. 3^. r. 2 jl. Olhcr- 

wJsc, it lins at least one. * 


m. FAl'l.T Aioni-tl, or MKMnitV lyt.KMKN'T (FITP-raX)!^ 

Kat'li nip-riop lias its own Inith l:il>lc. To Vtolernilne the 
neeuvney of fliiv-flop, it Is not noeessary (o find out wiuire 
Is wronc: insJiU' flic fliii-flop. The iniivirtant !IK 1 jc;cL is to 
IvTiow wlu’tlier llio fliji-nop does fimetion properly accorcilnR 
lo its desired tnilli table or not. If the flip-flop can not 
function corroclly for Gcriain excitation, then we say that 
the fllp-llop has malfunctkiii. 'I’ltn malfuiietinn of flifi-flop 
can he cliissiflcd, In iiuncrnl, into four cnLcRorlcn: 

1. IVire A ninlfimellon Is the flip- flop falls to rcmnln 
nt Its previous stal e. 



2, Type 11 iM!il function is the flip-flop falls to change 
Its previous stale. 

2, 'type C inalfiinetlon 5s the flip-flop falls to reset to 
0 stato. 

4,. Type ll in al function l .s ttic flii>-nop falls to act to 1 
state. 

Consider a D-fliii-flop, H will only have Type C and D 
jnalfunclinns. 'I'ype C inalfimcUon is caused iiy tlie failure 
of failing to reset to 0 state wiien D-O, Type D malfunction 
is caused by iho falluro of failing to sot to 1 slate when !>-' 
Tlio 'i’-flip-riop will only have Type A r.nd Ti inalfunGtions 
according to its truth table. The .IK-flip-flop will have :ill 
of the four pOGsildc nial functions. Tf the flip-flop docs not 
have tlio malfunotinna deaeribed above, then it is ensured 
to funetion properly under all the possiI)lc inputs unless 
there exists a intermittent (tr ansient) fault in.side the flip- 
flop. 


IV. SYNCHIiONQUS SRQl'El-nTAL NETWORK 


The difference Ijetweon sequential Gircuit and combina- 
tional circuit i.s tli.it tl'.c sfxiucntia! circuit contains memoi’y 
elements in addition to the combiuationnl logic. In a so- . 
qucnllal circnil, the present outjnit.s depend on the previous 
slates and/or prc.scnt primary inputs, and the state tran- 
sllimi.s depend on tiio previous states and primary inputs of 
the Cireult. Con.seqncntly, present fault may not affeet the 
present outputs. Hence we ciuinot in general cletoot the 
faults by a singie test pattern. It requires a sequence of 
test pattern to detect the faults in a sequential circuit and 
the length of ttie test sequence is proportional to the number ‘ 
of ineinorj' cieiuent.s in the circuit. There arc several 
lechiiifiues to reditco the length of tiic test sequence by means 
of addition a! control logic clreuitry. hi the following dis- 
cussion, we introriueo a new aspect of control Circuitry to 
detect the f.auKs in synchronous sequential circultB. In 
addition to Hie PI.^Ps, wc propo.se llio controllable memory 
modules for the incnioi"y olenioiits as follows; 

A memory elcinonl with direct set and reset capability 
wiioso outputs arc ennvcrtiijlc by two external control sig- 
nals Is called the coiiLrollablc memory modulo (CMM), 


A general model of CMM is shown in Fig. 9. If the flip- 
• ftop Is DFF (of TFF, JKFF, SHFF, etc.), the Input Is D/ 

(or T, JK, SR, etc.) and the module Is denoted as DCMM 
(or TCMM, JiCCMM, SRCMM, etc.). Its outputs arc q^ 

and cjj (where qj-yj'i’Cy^, qj-yjfli Cy^) and Cy^/Cy^ depends 

on Hie control signals of its successor PLM and direct set 
and reset signals of the flip-flop, mid Zy^ and zy^ are ob- 
servable outputs. 

The CMM will operate like tbc pure flip-flop unit In the 
CMM wiien the control signals arc set to o. For testing, we 
set Its control signals according lo Table III. The failure of 
CMM will i:e contributed by tho faults of its control porlion 
and the flip-flop unit. In the (ollowing disciussion, the fault 
In CMM is I'estrlcted to the single permanent fault. 


Succes.Bor of the Flip-Flop 

S , R , = 1 0 
d a 

Vd-°^ 

PLM 1/PLM 3 

Cy . = C . 
1 } 

Cy.=c/ 

Cy ,= C . 
» J 

Cy.^C. 

1 y 

PLM 2/ PLM 4 

lu 

It 

>. 

U 

Cy . = C . 
X y 

Cy. = C . 
1 J 

cy , = C . 

’ X J 


C^is 


Table III 

the control signal of PtiM. 


Consider the TCMM as sheuii In Fig. 10, The Tjqjs A 
malfunction can be caused by T^or (wiien the module l.s 

previously reset) or Cy^ <Qq) the equivalent 

fault. It can be delected by Hrst directly reset the module 
and then apply T=0 and Gy=0 (Cy-t). Tyjie B malfunction 
can be caused by or y^^ (when the module Is previously 

reset) or Cj'^ ^ Cduivalent fault. It can 

be dc'toctccl by first directly reset the ntp-flop and then apply 
T=1 and Cy=0 (Cy=l), The malfunctlon.s of JKCMM can bo 
detected by the similar concept. So the malfunctions of CMM 
can be detected by eurtaln Input pattern. The apiilication of 
using CMM lo enhance the tcutnbility of synchronous ooquen- 
. Uni circuit is lllustr,nto.d by the following example. 
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n n.viu'tirommii Mniity comilor {m|>lo- 

l\v T--fll|i-nui( ;m ithmvn i‘i I’lt!. 1 1 (it). » In li-iitm- 
roi’imtl lo Uu' t'lrciiU iia iilutwn In I'li;. 11 (ti) l>y mtbnUUitlnix 
Ow5 onllnarj- itntcs mill 'ivnip-ntips v4lli 1 It u proper PLMs 
and TCMMs lo ctilinnco Jls tcs(nbllil.y, where mi nddlUonnl 
Tnodtilo 1(1 iii.'iorlod to jnonltor tlio Iteliiivlor of caniblnntlonnl 
lople iwiHon and'z lo lUo inonlbivliii; (uil)ml. AH llio 
conlrol nli’imla nri> wt lo 0 for iionimt nporiiUon. Tor 
tcatbifi, four toHl-o «rc npjilloil nn followm 

Teat 1 (S^n^j"10) Set Cj=(i nml Cy^-0 for 1=0, 1,2, 3. 

W Utcrc axlsts no Type I fruit, whore 'I’j’po I 

fauUs= b^, c^, e^], Othci-wlnc the fault Informa- 

tion win be sensed nutl clrtvcc z to 0, 

m 

Test 2 (SjU^=00) Lot x=l and liccj) the control slipnnls 

unchanjiKl, If TCMM has type B 

IT , JfuncUon. Otheiuvlse, at least one TGMM has T^ipe B 
raalfuiiOlion. , ' 

Test 3 Set e^=l and Gy^=0 for 1=0, 1,. 2, 3. 

If then there exists no Type B fault, wltcre Type n 

faults= bj^, Cj^, d^, , Otherwise the fault Informa- 

tion of Tjiio II will force 7. to 1. 

• m 

Test 4 (S^B^jfOO) Let x=0 and beep the control sljjnalB 

unchanKcd. If 0000, tlicn no TCldk has 'I'jim A 

innlfunction. Othorwlao, at least one TCMM has Type A 
malfunction. 

As long as the circuit passes tlieso four tests, then Uie 
logic galcR arc guar-untced to be free of stuck f.sultSj and the 
TCMMs arc guaranteed to function properly, li. Is wortli lo 
point out the fact that the Exclusive Oil unit in TCMM is not 
requited In most of Uio sliifl register and counter clrcnUs 
which arc implemented by using TCMM, it is also true for 
DCMM, It is li«)xirtjmt to note lhat tbo procedure proposed 
horn iu Independent of tljc number of flip-flops 'and gateo In 
the circuit under test. 

V. CflNCbUSieN 

The application i>f I'Ltds, CMhlR are proposed wlilch will 
transform a circuit to ccrlala topclogical r.f ruiilure lo aug- 
ment the tcslnbillly Ilf the circuit. Two tests can detect any 
coinbjnalloiml cireuil with ebae.rvablo oulpuln. Systemntio 
clcniipi and deicetlon proccduro for coniblnatlfmaJ network nni 
prctirnlwL Eoui'icnln are miffltienl for fauH detection of 
iijmchmiuiun acqucnUnl circuit im|>lcmented wlUi T*-fllp- 



(b) 

Fig. 11 


flops or D-filp-flops. The same concept can be estchded 
for njTiohi'onous sequential circuit using JK-fl ip- flop or 
ER-flip-flop, and for sjTichronous seciucntiiil cireuiL using 
mixed tjqie of flip-flops. Systematic design and (.Icteelion 
procedure of any synchronous Ecquentlal circuit is more 
complicated than that of combinational circuit and r<!qujrcn 
further investigation. 
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II, 2 Parallelism Exploitation in Parallel Computing Systems 

To Hsu and S, J. Oh 


II, 2- 1 0 Introduction j 

This report summarizes the result of an investigation on I 

* i 

parallelism exploitation in parallel computing systems. The basic need I 

for using such a system is to increase the processing speed for real- | 

time application, without excessively increasing the cost or control | 

complesdty of the .system, ! 

1 

The discussions will center on three major topics, I 

■ ' ' 1 

1) Parallelism detection • j 

2) Task scheduling ! 

3) System configuration 

11,2-2, Parallelism Detection I 

i 

Parallelism refers to the fact that two or more subprocesses j 

! 

of a computing task may be done simultaneousiy, ‘ A necessary and I 

’ ■ I 

sufficient condition for this to be true is that the input parameters of I 

one process are not functions of the output parameters of another, and j 

vice versa. Under tMs definition, although not always separable, i 

J 

there are two basic types of parallelism in most computing problems: 

1) Algorithmic parallelism.; This is the type of parallelism 
where many independent but similar or identical computations 
are done as required by the computing algorithms. The 

parallel algorithm GOuld be a direct adaptation of a serial \ 

' i 

algorithm, or could be one created mainly for efficient j 

parallel processing. Typical examples are matrix 
multiplication, array data processing, computation of ’ 

fast Fourier transforms, etc, AlgorithmiG parallelism is 
reflexed in sequential programs (such as Fortran) by DO 
loops, although careful distinction should be made between 
the recursive type (where input data are updated as indices j 

are changed), and the non-recursive (or array) type, 1 
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2) Operational parallelisms This is the type of parallelism 
in which, although not always evident, exist several sub- 
computations which either bear no data transfer relations, 
or are not affected by the precedence order of their executions* 
It is therefore possible to detect these sub computations 
and make them concurrent operations* This type of 
parallelism generally exists in any sequential programs, 
both between statements and Within a statement. Good 
detection algorithms are essential for a)* designing 
efficient compilers for conventional uniprocessor systems, 
and b). generating parallel subtasks suitable for concur rent 
operations in parallel processing systems. 


II* 2-3* Task Scheduling 

While the parallelism detection algorithm may generate all 
possible concurrent processes regardless of how they can actually be 
utilized by the system, it is the function of the scheduling mechanism, 
either by software or hardware, to properly match these two together 
and result in an optimal or near- optimal performanee in terms of 
computation time, which is usually the criterion for the measurement 
of scheduling strategies. 

Principle of scheduling can be applied at different levels of 
a computational process, from program segmentation in a multiprocessor 
system, to the execution of microinstructions by several arithmetic 
units. With slight variations, each process can be propeidy modeled. 

In a parallel processinf^ system. We are specially interested in the 
■ scheduling of tasks at the machine instruction level, and a deterministic 
model in the form of a job graph is generally used. Each task node 
represents ah instruction execution, all linked by the preGedence 


e 



relations j and the node weight corresponds to its execution time- length. 

Several formal scheduling techniques exist, although most 
of them are both complicated and lengthy for practical application 
except for some simpler special cases like: 

1) when the graph is a tree and all nodes have equal weights. 

2) when there are only two processors to be scheduled and 
all tasks have equal weights. 

For more general problems, experimenta’ results have 
shown that optimal or near- optimal results can often be obtained by 
using much simpler heuristic algorithms. Two of the most prominent 
ones are; 

1) Critical path scheduling: for the nodes waiting to be 
processed, those on the critical path are schediiled first 
in accordance with the nximber of processors available, 

2) Largest-processing-time algorithm for independent tasks: 
tasks are processed in the order of decreasing task weights. 


II. 2-4 System GonfiguratiGn 

While the eonventional uniproGessor system operates on single- 
instruetion single- data stream (SISD), parallel processing and 
multiprocessor systems have structures to exploit single-instruction 
stream multiple-data stream (MIMD), Specialized systems under the 
names of array processors, pipeJme processors, and associative 
processors, etc. are variations of the SIMD type, while a number of 
imiprocessors interconneeted under various schemes belong to the 
MIMD type. Data processing, data access, and data alignment play 
major roles in the determination of system Gonfiguration, Existing SIMD 
systems have up to hundreds of processing units while MIMD systems 
are limited to a parallelism of 16 or so. - 

II. 2- 5 Conclusions 

We conclude tMs report \vith some eornments; 

i) Efficient parallel algorithms, which in many cases may not 
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be just simple adaptation of the serial counterparts, have the 
gross impact on parallel computing, and operational parallelism 
aids to speed up the computation at the lower level. 

2) In practical and real-time applications, simple 
scheduling techniques prove to be satisfactory. Formal 
algorithms can be used to establish the optimal standard 
for comparison, 

3) It is desirable to have a system's capability to reconfigure 
for computing purposes, but the increased problems seem 
quite prohibitive. Since a close relation between the computing 
algorithm and the system capability is vital for efficient 
processing, it appears that a basic need exists to identify 

the class of problems for application and base the system 
configuration on the most applicable and efficient algorithms. 

The advancement of microprocessor application may also 
point to this direction. 



II. 3 An E»:perimental Simultaneous Multiprocessor Organization 

G. S. Mersten and S. J. Oh 


The purpose of this research effort is to develop the basic 
requirements of a highly parallel information processing system and 
the specific development and implementation of an advanced 
experirriental computer organization concept entitled, "^mult^neous 
Multiprocessor Orgriiuzatio^', abreviated ’’SAMSON". Some of the 
areas to be studied encompass: 

Parallelism and parallel processors, 

SISD stream processors, * 

SIMD stream processors, 

MIMD stream processors and, 

MISD stream processors. 

The primary effort to date has been the development and 
implementation of one of the uniprocessors of the .SAMSON computer 
system,. This uniprocessor will be used at this time to collect data 
to determine the preliminary SAMSON configuration. 

The SAMSON uniprocessor is a general purpose, parallel 
digital central processor providing a full parallel sixteen bit 
arithmetic structure utilizing multiple general purpose accumulators 
and a microprogrammed control unit. 

The Arithmetic Unit of the SAMSON uniprocessor provides 
the capability to perform" arithmetic and logical operations on the 
various machine registers and memory. The information from the 
general purpose accumulators, memory, program counter and input 
bus are routed through the arithmetic unit imder control of the Micro- 
Control Unit. The Arithmetic Unit has been built and is fully operational. 

The Micro- Control Unit of the SAMSON uniprocessor is 
the heart of all the processor control and timing signals. The major 
element of this unit is a micro- control memory implemented with 
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LSI programable read only memories (PROM,'s), The content of 
this memory specifies which of the many machine operations is to 
be performed during every phase of every instruction. The Micro- 
Control Unit has been built and is fully operational. It is anticipated 
that the instruction set presently microprogrammed will be altered 
as this research effort progresses. 

The Memory Interface Unit consists of the control and 
interface logic required to interface the uniprocessor with a commercial 
core memory. This interface is built and is fully operational; it 
interfaces with a 4K core memory. 

The status of the tmiprocessor is displayed by means of 
the SAMSON Control Panel which provides display, control and loading 

capability; thus enabling the operator to control and observe the 

✓ 

operation of the various uniprocessors. Presently the control panel 
interfaces with only one uniprocess. As additional uniprocessors are 
added, the control panel will expand to accommodate the additional 
uniprocessors. The Control Panel has been built and is operational. 

A commercial paper tape reader has been interfaced via 
the Control Panel and paper tape reader interface. This interface 
is fully operational. In addition, a teletype interface has been built; 
however the teletype has not been received yet, 

A Micro- Memory Monitor has been built and is fully 
operational which allows monitoring the sequence of micro-memory 
executions. 

Present efforts are being directed toward the development 
of data collection software to determine the desireability of specific 
features of the SAMSON architecture. 
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Compilation and Scheduling of FORTRAN 
Programs for Parallel Processing Systems 

le Introduction 

Two phases of preprocessing (i.e, compilation and sched- 
uling) of Fortran programs are discussed. The purpose is to 
make the program more efficiently executable on .a parallel 
processing system by exploiting as many parallel-processable 
computations as possible. It is hoped that this will increase 
the throughput as well as system hardware utilization. 

Since only the speedup due to concurrent arithmetic 
operations is concerned, it is necessary to examine the v^-hole 
program and sort out those statements which involve direct 
arithmetic Computations. Namely, we shall deal only with the 
following three types: 

1. Assignment Statements 

2 . IF Statements involving arithmetic operations 

3. DO Statements. 

II, Compilation and Scheduling of assignment Statements. 

We call a Block of Assignment Statements {BAS) as a group of 
Statements consisting of only assignment Statements, They are 
not necessarily continuous Statements in tne original program, 
but should preserve the order that they appear in the program. 
The order is important because a variable may be updated before 

t 

or after it is used^ 
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!• Parsing of statements. 

A conunon way to show the syntacti*^ structure of an arith- 
metic expression for the purpose of compilation is to use a 
syntactic tree where each node represents a binary operation 
on nodes or operands of previous levels. Many different tech- 
niques have been developed to construct a systactic tree [1] , 

[2l among which , Baer and Bovet's technique [3J using; operator 
precedence order is in general considered the best to produce 
a tree with minimum height. Other algorithms using distribution 
law as developed by Muraoka, Kraska/ .and others [4] > [5], [6], 

£ 7 ] can sometimes be used to further reduce the tree-height. 

But the complexity of their algorithms prohibits a practical 
application. Therefore, only operator precedence orders will 
be considered, and they are, in decreasing order: 

(x, — ) , (+, -) , parentheses { ), 

We first define a number of terms: 

Def.: A tree is a set of nodes and arcs (edges) in a leveled 

structure such that each node at level j has two or more 
inward arcs emanating from nodes at level i, where 

, and each node at level j has only one outgoing arc 

« 

to a certain node at level k, where k>^ j+] . Nodes at level 
0 have no inward arCs, and are called leaves or initial 
nodes. 

There is only one node at the bottom level of the tree. 

It has no outgoing arcs, and is called the root of the tree, 
or terminal node. 


For sake of brevety, unless otherwise specified, a node 
will generally refer to one of those between the initial 
and terminal levels. ' 

Def . t If a node N£ has a chain of arcs leading to another node 

Nj, Nj is then called a successor of and a predece- 
ssor of Nj, denoted by Ni<Nj. If there is only one arc 
between N^ and Nj , they are called immediate predecesor 
and immediate successor to each other . 

Def,: A binary tree is a tree Xirhere each nqde has exactly 
two inward arcs. 

Def, ; A syntactic tree is a binary tree representing an arithmetic 
statement, where each node represents a binary operation 
and its result on the data carried from the two immediate 
predecessors by the inward arcs. Sometimes the terms "nodes" 
and "operands" (which the nodes represent) are used inter- 
changeably. - 

It is apparent from above definitions that a tree can not 

form a loop among its nodes. .. 

The following is an algorithm for generating a syi^tactic 

tree of an assignment statement. 

1) , Place all the right-hand-side (RHS). variables and their 

associated operators at level 0 . 

2) . Scan through the RHS of the Statement from left to right. 

If tvio operands can be joined by the operator betx'^een them 
without violating the operator precedence order, a nev7 node 



representing each one operation is created and placed at 
level 1. 

li" an operand can be joined with more than one operand, the 
one scanned first has higher priority. ; 

3) . Scan through the nodes in level 1 and those not yet combined 

in level 0 from left to right. Again combine node pairs where 
. operator precedence orders permit, and place them as new nodes 
in level 2, 

4) . Proceed in the same way to combine nodes in level 2 and so on 

until all operands of the RHS are combined and a single node 
results at the root xvhich is the desired left-hand-side (LHS) 
variable of the assignment statement. The syntactic tree 
is now completed. 

An example of parsing is shown in the following. 
X=A+B+C+D*E*F+G+H 

Level 0 

1 

2 

3 

4 

it is obvious that a syntactic tree of n initial nodes must 
haVe at least riog 2 n~| levels, where fx'] denotes the least 
integer Such rhat 
2. The BAS graph 

We now extend the strueture of a syntactic tree to include 
all statements in a BAS. Again a few terras are first defined. 
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Def . ! Let Si be the i^h statement in a BAS, then I (Si) is the 
■ set of input variables contained iri the RHS of Si, and 
0(Si) is the output variable of Si, i.e. the LHS of Si» 

We observe the followings 
If i) 0(Si)6l(Sj), or 

ii) 0(Sj)€l(Si), where i<j, 

then Sj can be completed only after Si is executed. ; 

Otherwise Si and Sj can be executed simultaneously. 

In above, condition i) indicates that Sj depends directly on 
the outcome of Si, and condition ii) indicates that I (Si) will 
be updated by Sj at a later time, hence Si must be executed 
before Sj. ' 

The' statements in a BAS are parsed in the following way. 

1) . Construct the syntactic tree of the first statement accord- 

ing to the procedures described in Sec. 1. 

2) . Go to the next statement. If I(S 2 )ni(Sj^)=G, and 

I (S2)f^O(Si)=0, construct the syntactic tree for S 2 as before, 
with all its initial nodes at level 0, 

Otherwise * S^contains one or more input variable or subexpression 
(part of the arithmetic expression) which has been encountered 
in S^ and represented by some nodes (initial, terminal, or in 
between) in the first tree. Hence, in cGnstructing tree of S 2 , 
those operands are taken from appropriate nodes of the tree of 
Sj. For any operation that combines and N j , where i and j 
indicate the levels of the nodes, and j^i, the resulting new node 
is placed in level j+1. 
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3) . Go to the next statement and repeat the same process except 
that the input variables are now checked against nodes of 
all previously generated trees to determine if any dependency 
exists. The process continues until all statements are 
parsed. 

We now have a "tree complex'^ of the BAS, Since some nodes may 
now have more than one outgoing arc and the whole tree complex 
can have several terminal nodes, we shall more properly call it 
a BAS graph. 

Def.s A BAS graph is a directed, acyclic graph consisted of 

interconnected syntactic trees , It shows all the intra~ 
statement and inter- statement operations. 

A few points are to be noted: 

1) . All nodes at same level in a BAS graph are operationally 

independent and hence can be executed simultaneously. 

2) . If a node has more than one outgoing arc, its information 

either has to be shared by more than one PE at same time 
(if immediate successors are at same level), or must be 
stored for later use (immediate successors at different 
levels) o 

3) , Since the parsing of statements is done seguentially in 

. order, it may not produce a tree whose intermediate nodes 
can be direetly used for subsequent statements, although 
such a strueturally different but functionally equivalent 
tree is possible and more desired. For example, for a BAS 
which contains a statement used in the last example, namely, 
X=A-I-B+C+D*EaF+G+H and another statement Y~B+C+E*P*K, if RHS 
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of first statement- is parsed as follows {shown partially) 
instead of what was Shown previously. 
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then nodes a and b are readily usable by the second statement 
while make no difference to the first one. 

This shows that to make even more use of the inter-- 
statement parallelism, other parsing algorithms are possible. 

But its value being offset by the added complexity in searching 
for identical suboperations among different statements is 
questionable for practical purpose, except in some simple 
programs where parsing is done by inspection. 

3. Principle of Scheduling 

To facilitate later discussions, we define the following 

terms, 

©ef , j A graph G is called a relaxed graph, denoted by Gj^, if all 
the -terminal nodes are placed in the bottom level (which 
has the largest level number in Gj^, assuming to be q) , and 
all other nodes are placed in a level such that the distances 
(i.e. the difference between level numbers) between each 
and its immediate successors are minimized (the minimum 
being 1) . 
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Def . s A node weight Wj^ is the number of time units required to 
perform the operation of node 

Def » s The largest backward path value of a node is defined 
by W^=Wi+ max [Wj j n j>n^ 

It is the largest sum of node-weights between n^ and its 
terminal nodes over all possible paths « 

Def. % The largest for\i?ard path value D^ of a node n^^ is defined 
by Di==Wi+max jp^ | nh<n^ . 

It is the largest sum of node-weights between ni and its 
initial nodes, 

Def. : The critical path value Gg of having g levels is defined 
as Cq=max |wj_j ni€Gj^ . 

The chain of arcs resulting in Cq is the critical path, 

Def, ; Let i be the level number in a Gr, l^i^q^ and there be m 

nodes contained in Gr from level 1 to level i. The partial 

node-weight sum. Pi, is defined as 

m ' • 

Pi=»twj. 

The scheduling of nodal operations is based on a relaxed graph 
with added information defined above. The resulting graph is 
commonly called a job graph. We describe the generation of a job 
graph from a BAS graph next. 

1) , Place all terminal nodes of G in the bottom level q of the 

graph. 

2) . All nodes in G. whose successors are only found in level q are 

placed in level g-1. 

3) . Repeat this process .upv;ard in decreasing level order so that 

all nodes which have successors only in levels greater than i 


are placed in level i of Gj^, ^i'he process terminates only when 
all nodes are adjusted. All arcs in Gr retain their relations 
with nodes as they have in G. 

4) Include node weights wi for all nodes n^^G|j, 

5) Compute the largest backward path value for all n^GGj^, This 
is easily done starting with nodes in level g and going upward. 
In general, Wj^ is found by adding Wj|^ to the largest of Wj*s 
v/here n . *s are the immediate successors of n . . The largest Wj 
in the graph is the critical path value, which also determines 
the critical path, 

6) Compute the largest forward path value for all nodes. 

. This is done Starting with nodes in level 1 and going downward. 
Dj is found by adding Wj to the largest of D^'s where n^'s 
are the immediate predecessors of n j . 

The following theorem by Kraska [5] determines a lower bound on 
the number of PE’s required to execute the job represented by 
Gg in critical time Cg. 

Theorem ; If max jp^^/D^ j i<c^>m-l, at least m PE's are required to 
process all nodes of Gj^ in C time. 

* 

Using a job graph, there exist at least two typical 
optimal seheduling techniques using either forward tracing 
or backward tracing [5] to schedule the operations of nodes on 
a fixed number of PE's so that the total execution time is 
minimized. However, both techniques are too complicated and time- 
ccnsuming for real-time applications. It has been tested and shown 


that a rather simple heuristic approach can in many cases be as 
good* as an optimal scheduling. The principle iss 
i) . Schedule first those nodes which are in the next higher 
level and/or with highest backward path values, 
ii) , Keep as many PE's busy as possible by assigning nodes which 
are proces sable next. 

4, Scheduling and Execution with Operational Stacks 

'We now apply the principle of heuristic scheduling to 
store and transfer information from a job graph to a number of 
stacks so that they can be directly executed. These staclcs then 
define the order and concurrence of operations and are called 
operational stacks. The number of stacks is set equal to the 
smaller of the number of PE's available in the system or the 
number determined from Kraska's theorem cited above. In this way, 
one PE will be executing instructions from one stack and no 
redundent PE's are used. 

Using the job graph, the stacks are loaded in the following 

way. 

1) , Starting from all initial nodes of Gj^, ( Nj , load operation 

instruetionS defined by each node on top of stacks , Two 
situations may Gceur: 

• i) . If (stacks) , some top stacks may be left empty, 

ii) b Otherwise, load stacks first from those initial nodes . 
whick have highest ' s . - 

in either case, denote -^hose loaded nodes in this step by 

2) , Examine the imn^ediate successors of { nj and those left 

in (n) (i.e. Gonsider those which are parallel 



processable (i.e. none are predecessors or -Jccessors of 
each other) as new { n} and load the nodes which have highest 
Wj^'s in the next level of stacks until either condition similar 
to step 1 has occurred. (If a new stack has an empty top level 
then the top level is loaded first) , In general/ when a node 
n^ spawns to two immediate successors (or more) nj and nj^, nj 
is put (directly) under n^ in the same stack, and in another 
. stack with a predecessor indicator (i.e. (i)K ) and n^ has a 
successor indicator (i.e. i(K) ). Similar indicators are 
used when two nodes and nj merge into one node nj^, and 
one of the predecessors (say Uj) is not in the same stack as 

3). Repeat step 2) throughout until all nodes ate loaded. 

The execution phase of instructions is straight forward. 

Each stack supplies an instruction seguenco to one PE, which is 
executed serially. All PE’s normally run in parallel, independ- 
ent of one another except when a predecessor indicator occurs 
before an instruction. This sets up a flag and stops the PE from 
executing the instruction, unless its predecessors are executed 
and the successor indicators clearing the flag. 

An example of the parsing, operational stacks, and simu- 
lated execution of a BAS is shovm next. 

Example: Assume a BAS as follows: 

Data; Tl, T4, Bl, B2, B3, xl, x2, x3. 

T2=2* (Tl~T4)/3+T4 
• T3=(Tl-T4)/3+T4 
T1B=,5* (T1+T2) 

. ^T2B=.5* (T2+T3) 


T3B=,5* (T3+T4) 

Z1=A1*T1B+B1 

Z2=A2*T2B+B2 

23=A3*T3B+B3 

Q= (T1-T4) / (X1/Z1+X2/Z2+X3/Z3) . 
The job graph is as follows. 

i Pi/Dl 


0 O/O T1 T4 



11 63/31 
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Notes ; 1). Assume node weights for-f =5 

x=3 

+ , -=2 

2) .. Numbers outside nodes are largest backward path 

values W^, 

3) , Critical path value=36, critical path in heavy lines. 

4) . Max jp^/oj =Pjo/Djlo“ 2<> If hence 3 PE's is the lower 

bound for achieving 0^=36. 

Vi » 



III, Preprocessing of IF Statements 
Given an IP Statement of the type 

IP (s) ‘ 1,2,3 

where s is an arithmetic or logic eis]pression, and 1, 2, 3 
are branching instruction^ , the most obvious way to make use of 
parallelism is to parse s for maximum parallel suboperations. 

For example, 

IP (B*B-4*A*C) 6, 8, 9 

is converted to X=B*B-4*A*C 

IP (X) 6, 8, 9 

With added circuit 3ry if is possible to start processing 

all the branching instructions 6, 8, 9 simultaniously , v?ith the 
final result determined by the outcome of X, Hov^ever, since most 
IP statements have only simple expression as argument, the con- 
current operations of branching instructions on one IP statement 
alone does not cause considerable speedup. More sophiscated 
algorithms can be used to detect a block of statements with high 
Gccurrence rate of IF statements and converts the IF statements 
into a binary decision tree v/hich is then processed by a decision 
processor. More assignment statements associated with the IP 
statements are also collected and processed simultaneously. Details 
on this subject are discussed by Davis [9]. 

IV. Preprocessing of DO Statements. 

The principle in treating DO loops is to separate the loops 
into several completely independent DO loops and process them 


simultaneously. In order to do thiS/ precedence relations among 
the statements in the loop are examined. But, in addition to the 
ways encountered in the treatment of a BAS, since data can be used 
first and updated next and then again used in the next iteration, 
input variables have to compare with outputs of not only state- 
ments before it but also after it, 

1) . Compare I(Sj) with 0(8^), where i<j , If a match in variables 

with same index is found, then place an arc from node i to 
. node j. 

Also compare I(Sj) with O(Sj^), where k>j . If a match in 
variables is found whose index is higher in S^, there is an 
arc from node k to node j . 

2) . If some subgraphs are completely disconnected at the complet- 
• ion of the graph, each subgraph becomes a new DO loop, and 

they are all parallel proces sable. ■ ' 

Before looking into each new DO loop , we first define two terms . 
Def , : A DO statement is called recursive if it has the form 

X^:=f ( .•« , aj_) where m is any positive integer 

and a^^ some vector of parameters, Other;^ise the DO state- 
ment is non-recursive and Galled of array type, • 

We now continue the steps, 

3) . Within each new loop; 

i) . If statements are of array type, one statement can be • 

generated for each index value, resulting in many parallel 
statements. For example, ‘ 

DO 1. 1=1, 3,1 

1 A(I)=A(I+1)+G(I)*D(I) 


r 


A (2), and A (3) 


becomes A(1)=A(2>+G(1)*D(1> 

A(2).=A(3)+C(2)*D(2) 

A(3)=A(4)+G(3)*D(3) 

Three PE’s (or groups of PE’s) Can work on,A(l) 
separately and simultaneously. 

The assignment and scheduling then are same as in the case of a 
BAS, • ■ ; 

ii) o If statements are of recursive type, backv7ard substitution 
is used to generate, statements which allow simultaneous 
operations . For example , . 

DO 1 1=1,5 

1 A(I)=A(I-1)+B(I) 

A(1)=A(0)+B(1) 

A(2)=B(1)+B(2>+A(o) ■ 

o 

« , 
A(5)=A(0)+B(1)+B(2)+B{3)+B(4)+B(5) . 
which may be computed in only 3 addition times instead of 5, with 
3 participating PE's. 

It is also to be noted that in this case, A(l) through A (4) 
are actually included in A (5). Hence it suffices to parse only 
the RHS of A (5) and in the process appropriate intermediate results 
are obtained for A(l) through A (4), 
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An example of decomposing a DO loop is shown next, 
DO 6 1=1, 3, 1 . 

1 T(I)=G(I)+M 

2 G(I)==T(I)+D{I) 

3 E(I)=P(I“1)+B(I) 

4 P(I)=S(I)+G(I) 

5 H(I)=A{I“1)+H{I-1) 

6 A{I)=C(I)+N 



In the accompanying graph, we observe: 

•1 — > 2 because of T(l) in both. 

2 4 because of G(I) in both. 

3 4 because of E(I) in both. 

4 ^ 3 because of F(I) in 4 and F(l~l) in 3 . . 

€ — 5^ 5 because of A (I) in 6 and A(I-l) in 5 . 

And since there are 2 completely disconnected subgraphs, 2 new 
DO loops are created: 


DO 4 1=1, 3, 1 

1 a?(I)=G(I)+M 

2 • G(I)=T(I)+D(I) 

3 E(I)=F(I-1)+B(I) 
F (I)*E(I)+G(I) 


DO 6 1=1, 3, 1 

5 H(I)=A(I-1)+H(I-1) 

6 A(I)=C(I)+N 


4 


r 


t 
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We further decompose each new DO loop as follows; 


For 

first subgraph; 

• 


DO 1 1=1, 3, 1 


1 

T(I)=G(I)+M 

generate 


DO 2 1=1, 3, 1 


2 

G(I)=T(I)+D(I) 

generate 


DO 4 1=1, 3, 1 

. 

3 

E(I)=F(I“1)+B(I) 

generate 

4 

F(I)=E(I)+G(I) 

generate 


(comment) 

3 array equations T (1) ,T (2) ,T (3) 

3 array equations G(l) ,G{2) ,0(3) 

3 recursive equations E (1) ,E (2) ,E (3 ) 
3 recursive equations F { 1 ) , F { 2 ) , F ( 3 


By back- substitution, the recursive equations for E and F are 
E(1)=F(0)+B(1) 

E(2)=F (0)+B(l)-fB(2) 

E(3)==P(0)+B(1)+B(2)+B(3) * ; 

F(1)=F (0)+B(l)+G(l) 

F(2)=^F(0)+Ba)+G(i)+B(2)+G(2) 

F(3)=F (0)+B(l)+G(l)+B(2)+B(3) +G(2)+G(3) 

Parsing on F.(3) produces all the answers for above 6 equations: 



r’ 




• For the second loop, 3 array equations and 3 recursive 
equations are generated in similar ways for statements 6 and 5, 
respectively. We also observe that all three array equations of 
Sg can be executed simultaneously and before Sg. 

VI. Conclusions ~ 

Some compilation and scheduling aspects of computation~ 
oriented statements for parallel processing have been discussed. 
The treatments have been confined to rathe"r general cases, so 
that the results are applicable to most parallel processing 
systems. Although other theoretiGally superior algorithms or 
techniques are available, it is felt that their real-time 
appliGations are not so readily justifiable due to the amount of 
Software/hardware sophistication, except possibly in some rather 
special applications. 
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1.0 INTRODUCTION 

The purpose of this paper is to present a general overview 
of an ongoing research program in the area of highly parallel infor- 
mation processing system organization and the specific development 
and implementation of an advanced experimental computer organizational 
concept entitled, ''Simultaneous Multiproees&or Organization", abbrevi- 
ated "SANSON". This work is broad in scope, encompassing; 

p parallelism and parallel processors,* 

o completely Independent multiple processors or multiple 
Independent single instruction single data (SISD) 
stream processors, 

o concurrent processors or single instruction multiple 
data (SII'ID) stream processors, 

o task sharing computational processors or multiple 
instruction multiple data (MIND) stream processors, 

o loosely and strongly-connected multiprocessors or multiple 
instruction single data (MISD) stream processors, 

o raultiprogrammed multiprocessors, 

0 real-time multiprocessors, 

o memory organ! zat ion(s ) , 

o communications and data routing and, 

o path building and supervisory control. 

It is useful to classify the requirements and problems 
associated viith a multiprocessor computer system into two classes, 
as suggested by Saltzer^, intrinsic and technologicalo Intrinsic 


* This paper uses the terra "processor" rather than computer here, to 
distinguish the structure of the computational processing from the 
overall computer system which includes input/output, I/O, processing; 
peripherals; etc. • 

+ As distinguished from task sharing with separate processors for -I/O, 
computation, etc. 


requirements and problems are inherent In the problem Itself whereas 
technological requirements and problems are transitory In nature, 
being Inherent to the technology at a given point in time. The 
Important Intrinsic problems being studied in this research effort arei 

o providing an easy ability to access, transform and 

share Information within a multiprocessor in a controlled 
mannero 

o to provide protection of this Information, and 

o implement an experimental multiprocessor, SAIISON, with 
these features o 

In addition, the technological limitations which Impose 
bottlenecks within a multiprocessor being Investigated as a part 
of this work areJ 

o the memory contention problem and, 

o the Goramunleation problem o 

In addition, the SAMSON computer, Figure 1, must be a 
viable system providing a basis for, and a vehicle upon whichi research 
of advanced multiprocessor organlzatiGnal and architectural concepts 
can evolve; that Is, a research vehicle for multiprocessor computer 
system development, capable of supporting a v/lde range of investigations 
In computer structures and computer sclenceo SAMSON must be capable 
of possessing the necessary flexibility and computing capability to 
assure its applicability to a wide variety of present and future 
computational applications and to assure the system’s continuing 
evolution and viability o’ 

To meet these goals and to satisfy the widely varying 
nature of the computational requirements Imposed on a simultaneous 
raultlprocessor, the system demands a genei’al-purpose computer 
organization capable of a certain degree of parallelism. 

In this research work a strongly-connected multiprocessor 
capable of SISD, SIHD and MIMD stream processing (which are special 
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cases of MISD stream processing) as well as being capable of MISD 
stream processing will be developed and an experimental machine 
implemented, the SAMSON machine^ Particular attention will be given 
to memory organization and communications in order to eliminate the 
classleal memory contention and path building problems peculiar to 
multiprocessors. Furthermore, the requirements for multiprogramming 
and real-time multiprocessing will be Investigated and, if practical, 
one or both of these features, Incorporated into the SAMSON architec- 
ture. 

9 

1.1 MOTIVATION FOR PARALLELISM AND MULTIPROCESSING 

There are several reasons which exist as motivation for 
doing research in the area of multiprocessors. Three of the most ‘ 
obvious reasons being improved reliability, improved maintainability 
and ease of expandability. However, these motivations,, in and of 
themselves, are not adequate justification for pursuing this research, 
since duplex systems* (which are not here considered as multiprocessors) 
offer improved reliability and maintainability while providing an 
expansion capability through the use of the back-up processor. Thus, 
additional motivation must exist to justify pursuing this research-; 
this additional motivation is indeed the prime motivation for this 
research. 

This additional motivation results from the ever increasing 
complexity of present day computational requirements, coupled with 


* A duplex system usually consists of two identical processors 
operating so that, in the event of shut downs due to maintenance, 
checkout, improvements, etc., of one processor, the second can 
operate v;ithout a reduction in capability of the system. In 
addition, the back-up processor may be utilized to run lower 
priority tasks or as a slave to the primary processor (in which 
case a shut down of either will reduce system capability). 


the necessity of ever Increaslngs data processing rates, response 
time, and capacity* These requirements, in specific application 

have exceeded and, in general, are gradually exceeding the processing 

2 

capability of some of our present generation computer facilities* 

Purtherraore, recognition of the fact that generally much 
of a large computing task is repeated execution (on similar or 
dissimilar pieces of data) of the same procedure, leads one to the ‘ . 
observation that utilization of this inherent parallelism in a 
simultaneous multiprocessor organization might well provide a 
dramatic improvement in computing power* ^ 

Consequently, it is the ultimate objective of this 
research effort to develop a multiprocessor organization capable 
of a significant Increase in computing power through computational 
parallelism, with Increased flexibility, reliability and maintainability, 
without exceeding the existing limitations of the component technology. 


2.0 HISTORICAL BACKGROUND 

The concept of parallelism is very broad and Includes a 
wide variety of notions. Intuitively these notions includes many 
devices working together in some way or other on the same task, 
devices working together on different but related tasks, and one 
device performing more than one task at a time. This broad notion 
of parallelism is, in part, responsible for the \irlde diversity pos- 
sible within the area of parallel processing, l.e*, SISD, SIND, MISD 
and MIND stream processing* 

Prior to a discussion of the historical background of 
parallel processors, the concepts of global and local control need 
be introduced, - 

Global control implies the existence of one or more 
central contro?. units, either centrally or decentrally located, having 
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common control over a number of processing elements., Local control 
Implies control of a processing element by its own control unit 
(either locally or remotely located)* 

K 

The SOLOMON computer'^ is a classical machine concept 
consisting of an array of 32 x 32 processing elements, each con- 
nected to Its four nearest neighbors, under the global control of pne 
central control unit. These processing elements have limited pro- 
cessing power? the central control unit processes a single program 
with each participating processor executing the same instruction 
stream on its own data stream, SIMD stream processing. Depending 
upon the internal state and mode of each of these processing elements, 
they either participate or do not participate during an instruction 
.execution. The internal state of the processing element is a 
function of the data it is processing while the mode of the processing 
element is determined by the control unit. The principle means of 
control for each processing element is the mode commands. The major 
disadvantage of this machine organization and of global control 
structures in general, is its low efficiency when applied to typical 
general-purpose computations.^ These computations require sequential 
execution? consequently, the central control unit utilizes only one 
(or relatively few) processing elements while all others stand idle. 

The ILLIAC IV system'^* alleviates the problem of the 
SOLOMON machine to some degree bys replacing the single global 
eontrol vinit v?ith four such units, by providing I/O access to each 
processing element, and Increasing the power of the processing 
elements. Each of the four global control units directly governs the 
operation of an 8 x 8 array of processing units. These global control 
units may function Independently (64 processors per array) or in 
groups of two, three, or four arrays. These processing elements .are 
ail but devoid of local control, mode status and data dependent 
conditions being the only exceptions. 


4 
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Q 

The HOLLAND machine^ Is a distributed control machine con- 
sisting of an array of modules* each of which possesses some measure 
of local control and a limited capability for independent Instruction 
exeeutiono Within this structure the capability exists (to a limited 
extent) to execute Independent programs simultaneously o HOLLAND pro- 
posed the organization to provide a basis for theoretical investiga- 
tions* not as a practical deviceo Each module contains a storage 
register, operand registers and communication paths to its four 
nearest neighbors- An instruction stream is stored one instruction 
per module with indicators to denote the predecessor and successor 
modules (one of the four neighbors)- The instructions are executed 
in time sequence following the above predetermined path- The 
address of the operand is also stored in the module- The storage 
register can be loaded from an external source during the first 
machine cycle. During the second cycle the active module determines 
the operand location and establishes communications between the 
operand and the accumulation module- In the final phase the instruc- 
tion is executed. The proposed attribute of local control structures 
is the high degree of hardware utilization that can be achieved when 
many (small) computations are being executed concurrently; however, 
the HOLLAND machine architecture has fallen far short of its goal. 

The main problem associated with the local control organization of 
the HOLLAND machine is the spaclal structure, of both the array and 
of the neighborhood communications, which results in serious path 
building problems - 

III addition to the above mentioned array multiprocessors 
there have been some noteworthy multiple computer systems which have 
been developed. These include the PILOT, LASc .and STAH Systems- 

10 

The PILOT system , consists of three Independent proces- 
sors- The basic computer functions are divided, among these three 
different processors, upon the basis of the processing task. Each 
processor is optimized to perform one of the following basic functions*. 
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arithmetic and logical operations# control and housekeeping operations, 
or input-output operations® The processors are independently programmed 
to perform their own tasks® The PILOT is not here considered a multi- 
processor® 

13 . 

The LARC system is a dual computing System consisting of 
two main computers, a totally time shared memory bus (with a maximum 
of 39 Independent memories), and' an input-output processor. The 
main feature of the LASC system is its memory organization. The 
memory system consists of a single time multiplexed memory bus and 

o 

includes such features as overlapping (which makes possible the pro- 
cessing of several instructions concurrently), priority determination 
and interlocking memory protection® The LARC is, here considered, a 
limited multiprocessor; the two main computers are capable of operating 
either independently or jointly to a limited degree® 

12 

The STAR computer is a replacement system utilizing a 
redundant machine organization having one or more { unpowered) spares, 
which are switched on-line to replace units® To .detect and replace 
these failures, the -system utilizes s error detecting codes, software 
diagnostics, software recovery procedures, transient fault identifi- 
cation (by repetition of program segments), and redundant special 
purpose processors to monitor, vote and replace subsystems® The 
STAR is not here considered a multiprocessor® 

Other than the ILLIAC IV none of the above are contem- 
porary processors® The only major contemporary process'or under 
construction is the C®ramp (Carnegie Mellon multl-minl-proeessor) 

The C»mmp is in its early stages and little can be said in the way 
of operating features, performance, etc. at this time® 

ILLIAC IV is still in the construction and debugging phase; only 
one of the four quadrants will be constructed® 


e 



3.0 DEFINITIONS 


The concepts of parallelism and of multiprocessors are 
both very broad in scope. It Is the Intent of this section to 
attempt to define, or at least to place some bounds, on these 
concepts. 

3.1 DEFINITION OP PABALLELISM 

Previously the intuitive notions associated with the terra 
parallelism were Introduced. The concept of parallelism can best 
be formally defined by dividing it into two types, applied and 
natural parallelism. Applied parallelism is the property that 
enables two or more identical operations within a set of computations 
to be processed concurrently on the same or distinct data bases. 
Natural parallelism is the property that enables two or more opera- 
tions within a set of computations to be processed concurrently and 
possibly Independently on the same or distinct data bases. 

Prom these definitions it is clear that applied parallelism 
is Just a special case of natural parallelism. However, the dis- 
tinction is made because of the important impact it has on com- 
puter organizations. Applied parallelism is efficiently handled by 
global control techniques since it provides common control for 
identical operations. Natural parallelism is efficiently handled by 
local control techniques since it provides independent process ing for 
different operations. Pigure 2 Illustrates how the expression 

A/X + B/X + CY = Z 

is computed with and without the use of applied and natural parallel- 
ism. 

The SOLOMON machine discussed above is based upon the 
principles of applied parallelism while the Holland machine is based 
upon the principles of natural parallelism. 


FIGURL 2 Computation of the Expression A/X + B/X + CY ~ Z 

in (a) sequential steps, (b) utiliv'.inp applied parallelism 
(c) utilizing natural parallelism, and (d) utilizing 
applied and natux'al pai'allelism . 
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3,2 DEFINITION OP MULTIPROCESSOR 

Much controversy exists In regard to the definitions of 
multiple computer systems. Frequently, the terms multiple computer, 
multiple processors, parallel processors, multicomputer and multi- 
processor are used interchangeably! in this work a distinction is made 
and their broad definitions presented, 

. MULTIPLE COMPUTERS, MULTIPLE PROCESSORS or PARALLEL 
PROCESSORS as defined in this work are general terras which are used 
interchangeably and apply to any system containing two or more, not 
necessarily identical digital computers, ThSse digital computers 
may be elementary processing units or full scale digital computers; 
they may be conventional or 'oneonventionai , general purpose or 
special purpose, and they may operate jointly or Independently of . 
one another, 

MULTI COMPUTERS as defined here is a group of two or more, 
not necessarily identical, digital computers interconnected via 
their input-output system (i.e, obtaining intercomputer communica- 
tion by considering other computer(s) as peripheral devices). These 
digital computers may be elementary processing units or full scale 
digital computers; they may be conventional or unconventional, and 
they may be general purpose or special purpose, 

MULTIPROCESSORS as defined here is a group of two or 
more, not necessarily identical, digital computers interconnected on 
an integrated basis (to effect intercomputer communiGations) capable 
of operating concurrently bn the same or similar phases of a pro- 
cessing task. Similarly, these digital computers may be elementary 
processors or full scale digital computers, conventional or uncon- 
ventional, and/or general purpose or special purpose. (The re- 
quirement of operating on similar phases of the processing task 
simultaneously eliminates the class of multi computers having inde- 
pendent processbrs for arithmetic functions, control funetions, ^d 


I/O functions from being here considered as multiprocessors .) 

16 

Bauer' s'^^ definition of a multiprocessor (which he referred 
to as a multi-computer system) required that It satisfy the following 
four criterias 

”lo There are two or more separate arithmetic control 
units capable of operating simultaneously and with 
special hardware facility for operating two or more 
programs simultaneously o 

2o There are two or more indepently operating primary 
random access memories.. 

m 

3o Communications among major elements of the system Is 
of the memory-to-memory type at memory access speeds. 

4. All major components of the system are In use during 
normal operations.'*' 

In this work Bauer's criteria for defining a multiprocessor 
are amended as follows? Criterion 2, Is intended to imply that two 
or more quantities can be accessed simultaneously; however, Bauer 
used this criterion to eliminate from contention systems having one 
large random access memory. Systems having one such large random 
access memory, capable of being accessed by more than one processor 
simultaneously (l.e. by Interleaving), as well as, systems having 
multiport memories should not be, and are not here, elimated from 
being classified multiprocessors. Therefore, criterion 2 should 
read? Two or more independent quantities can be accessed simulta- 
neously. • . 

Criterion 3 does not apply to systems which provide for 
access to a common memory. Furthermore, register to register type 
communications should be respected as a possible means of Intermachlne 
communications. In fact, since minimum coraraunlcatlon time is fre- 
quently a major multiprocessor goal, register to register coraraunica- 
tlons is an extremely desirable feature and Indeed should be a sought 
after objective in a multiprocessor architecture. 
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The frequency of use of all or most of the major components 
of the system should he sufficiently high to justify a multiprocessor. 
(Unfortunately, whether or not criterion 4 is satisfied* assuming a 
reasonable architecture, is largely a function of the programmer’s 
ability and ingenuity, his familiarity with the system, and the 
nature of the programs for which the system is utlllzedo) Alternatively, 
the extent to which a multiprocessor utilizes its hardware efficiently 
could be ‘considered secondary to the speed advantage hoped to be 
gained by a multiprocessor organization. 

4.0 SAMSON GONPIGURATION 

Throughout the development of such an experimental 
multiprocessor, evaluations of various design alternatives and a 
determination of the effects of various parameters on total system 
performance will be studied. Through this effort we will be advancing 
our knowledge of the type of architectural and programming techniques 
required of future generation parallel processors to achieve increased 
effectiveness, performance and computational capabilities.* 

No one machine organization can handle widely divergent 
tasks with equal capability. However, the multiprocessor should be 
capable of handling a broad range of tasks more efficiently, at 
least time wise, than a uniprocessor. These tasks should not be so 
broad that the efficiency of any particular task be seriously 
Impaired. In the worst case a multiprocessor should be no worse 


* No specific programming development is Intended as a part of this 
researcht however, insight will be gained regarding programming 
techniques during architectural development. 
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than a general purpose uniprocessor.. 

Hardware utilization efficiency must be secondary to 
computing power if system versatility is sought. If the tasks to 
be handled are highly restricted and well defined, in advance, then 
special purpose hardware can be utilized to optimize the processing 
tasks. However, use of such special purpose machines, on tasks not 
originally included in the limited set of tasks considered, would 
be highly restrictive and inefficient. 

Consequently, if versatility is a goal, a general-purpose 
architecture is called for and the related -inefficiencies associated 
with particular tasks must be accepted. A general purpose archi- 
tecture tends to Increase total execution time; this is equally 
true of uniprocessors and multiprocessors. 

4.1 OPES ATI OH AL PEATCSES, 

The operational features of the SAMSON machine must 
Include the facility for data manipulation arid computation as well 
as the capability of controlling the sequencing of instructions. 

SAMSON will include all of the conventional facilities 
for (local) instruction execution and it will provide global 
instruction features. In addition, SAMSON will possess a high 
degree of concurrency in that simultaneous operation of several 
processors jointly on one task or separately on different tasks 
will be possible, . 

The basic processor (central processor) elements of the 
SAMSON computer are l6 bit general purpose raleroprogrammed processors 
having multiple general purpose registers operating on 16 bit data 
words. Their instruction repertoire will Include special multiprocessor 
instructions (global instructions ) , such as PORK, JOIN, etc. 


The major characteristics of the SAMSON processor 
elements ares 

o Type 

— general purpose* digital, binary 
-«-full par al Tel organization 
— rai cr opr ogr ammable 

o Arithmetic 

—binary, fixed point 
—16 bit data word 

— negative numbers in 2*s complement form ^ 

--add times 1 sec (register»to register) 

2 P sec (memory to register) 

— multiply times 21 P sec (Average, register to register) 
— divide time: 27«5 ^ sec (Average, register to register) 

G Memory. 

— DRO core 

--1 /'■ sec cycle time 

— each memory is expandable to J2K words in 4K or 8K 
increments 

o Addressing Modes 

--direct addressing to 512 words of memory 
—Indexed addressing (execution time unaffected) 
—multilevel Indirect addressing 

o i npu t /Ou t pu t 

— program interrupts expandable 
— power fail and power-restore interrupts can be 
provided 

— I/O communicates with CPU and memory by means of 
separate common data and address bus; I/O devices 
can be added as reciuired 

—standard peripherals - teletype and/or CRT display 
and paper tape reader ' , 


Execution time based on a memory cycle time of 1 p sec» 
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o Logic 

—implemented with LSI and MSI arrays of T^L 
4,2 MEMORY ORGANIZATION 

Multimemory aGcessabllity is required at the processor 
level in a multiprocessor since, in general, it is impossible to 
operate solely out of one memory unless swapping procedures, anti- 
clpative procedures, or mapping techniques are utilized. Although 
these features are desirable in a uniprocessor memory hierarchy to 
obtain an economical configuration, their usefulness as the primary 
memory for a multiprocessor is questionable. Nevertheless, these 
techniques could be utilized to supplement the primary memory. 
SAMSON’S present memory organization will not include these features; 
however, provisions can be provided for their addition at a later 
date,' should the research in this area so Indicate. 

A modular memory organization consisting of several 
small singleport memories, rather than one large common multiaccess 
or multiport memory, will be utilized in the present SAMSON 
architecture. It should be noted that this was one of Bauer’s 
criteria for a multiprocessor and although exception was taken pre- 
viously, the exception was based upon this criteria being a necessary 
condition (and not on it being a desirable condition). Each memory 
module will be accessible by all processing elements. This scheme 
provides for parallel transfer of individual words or entire memory 
modules using present day switching technology; thus providing the 
capability of transferring large quantities of information at full 
memory rates. This multi-memory organization allows programs and 
data to be shared by several processing elements on a high speed 
basis while providing a means of insuring privacy of data. When 
Independently operating, the processors which comprise the SAMSON 
computer system will be able to communicate via memory to memory 
t'ransfers under program control. Memory transfers will be on a 
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request basis, thus avoiding catastrophic memory destruction. Careful 
consideration must be given with regard to providing a means of Insuring 
privacy and protection of data without unduly limiting the capability 
of sharing. 

It is Intended that at any given time each processor c^ 
have access to several memories, but only those assigned to it. 
Furthermore, any processor (if assigned) can have accessibility to 
all memory modules (memory stacking) which would be in excess of its 
Inherent addressing capability; thus providing an expanded high 
speed memory capacity. 

In a multiprocessor, it may well be desirable to be able 
to write identical, duplicate copies of data and/or Instructions 
into several memories without any additional expenditure of time. 

This feature is presently being considered for the SAMSON machine. 
Research in this area is required. 

The above memory organization enables the total computing 
power of SAMSON to be applied to those tasks requiring the full 
computing power of the machine. 

4,3 COMMUNICATIONS STRUCTURE 

The communication requirements of a multiprocessor can 
be divided into two classes of communications: Intermachlne 

communications and Input/output (I/O) communications. In either 
event the time ■Investment made in establishing the connection should 
be minimal, or . alternatively the connection should be retained f or 
a relatively long period of time in order to justify the time 
expenditure in establishing the communication link. Furthermore, 
the data rate should be sufficiently high to satisfy system 
operating requirements. 

Three basic communications techniques exist where by the 
major system components and peripherals can be interconnected, namely, 





space-division, time-division and frequency dlvlslon« 

The space-division techniques, Including completely 
dedicated communications as well as multilevel switching techniques 
(crosshar switching), are conceptually the simplest* However, an 
excessive amount of hardware is required and/or operational diffi- 
culties such as path building,, eonfllcting communications requirements, 
and extended periods of time required to establish connections may 
occur* 

Time-division or time multiplexing techniques are a 
viable alternative approach* Here queuing and priority schemes 
are utilized to Insure that one and only one device pair can have 
access to the data bus during any one time slot* With this 
technique time slots. can be either preassigned or a dynamic alloca- 
tion procedure utilized* The drawback of this technique is the 
possibility of one device pair (with proper priority) completely 
tying up the communication link* 

A frequency-division (frequency-multiplexing) seheme 
appears to offer the greatest flexibility and potential* The major 
drawbacks to this approach ares Garrler frequencies in the range 
of 100' s of megahertz are required (to meet the processors data 
rate ' requirements) and furthermore, the necessary hardware for 
modulators and demodulators (which are not standard computer 
components) needed for transmitting large amounts of information 
would he excessive* , 

4*3*1 INTEBMACHINE COMMUNICATIONS 

SAMSON’S Interraaehine communications, as presently viewed, 
will be a Gomblnation of both time multiplexing and spacial communi- 
cation techniques* It is felt that this approach is the most amenable 
to present state of the art computer techniques. 



Memory communications will use both of these techniques. 

In that each memory will be assigned a ‘ dedicated • coramunl cat! on path 
to its associated processor as well as a connection to the common 
intermachine memory communication, network. 

Intermachine register to register communications is also 
being considered for SAMSON, However, no decision has been made 
whether such communications will be included in the present SAMSON 
architecture, 

4.3,2 IKPUT/OUa-PUT FACILITIES 

«> 

The purpose of an input-output (I/O) system Is to 
efficiently, reliably, and In an orderly fashion transfer the 
maximum amount of Information in a minimal amount of time and with 
minimal interference to other subsystems. The I/O philosophy should 
provide a general purpose Interface with all the necessary control 
and communications paths to allow data transfers betvfeen the various 
peripheral devlcss and the main processing system, 

I/O communication requirements differ from intermachine 
communication requirements since the peripheral devices have’ widely 
varying characteristics, because of the multiplicity of the peripheral 
devices, and the unrestricted physical length over which the 
communication may take place. An additional fundamental difference 
in the type of communications results from the assignment of 
peripherals. Whereas memories are assigned to computers under program 
control (either by request or otherwise) I/O peripherals belong to a 
common pool of devices which, although available to all, are only 
"loaned" to a particular processor. The peripheral remains on loan 
to the assigned processor until the request is fulfilled or in some 
Instances until an interrupt occurs from a higher priority source, 

•Reasoning as above, SAMSON »s I/O communication philosophy 
will be based upon a time multiplexed system. (Fixed time slots will 
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not be utilized here due to the pseudo random nature of the required 
Interconnections). Furthermore, the I/O system will be a request- 
response system, thus allowing maximum transmission rates over 
various physically different length paths. 

In addition, the SAMSON I/O facility includes both 
internally and externally controlled data transfers. 


. 5.0 RESEARCH STATUS 

It Is the objective of this X'orork to physically implement 
the SAMSON machine, as described herein, with appropriate modifications 
reflecting the results of the ensuing research. 

The SAi^SON machine will be a useful tool for performing 
precise mathematical operations across a broad spectrum of appli- 
cations while providing a multiprocessor both suitable for, and 
with the necessary capabilities to enable research In such areas 
ass 

o Multiprocessor architecture (few multiprocessors 
have been constructed; thus each represents a major 
research milestone)^® 

o The processor-memory interconnection 

o The memory contention problems 

o The communication mid control of a multiprocessor 
memory 

o The communication structure of multiprocessors 
G The instruction repertoire mandated by a multiprocessor 
o Applied and natural parallelism 
o Global and local control 
o The hardware operating system 

o The effect of a multiple aecumulator and/or raultipie 
Index register structure 


( 
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o Interrupt handling, DMA operatl^in, cycle stealing and 
priority assignments 

0 The effect of a hierarchical memory organization on a 
multiprocessor 

o The effects of overlapping, prefetching Instructions 
and operands, multiple instruction decoding and 
conditional branching 

o Data flow efficiency and input/output flexibility 

• o Diagnostic procedures, reconfiguration and degraded 
operation 

o The optimum sequencing problem 

G Software application versatility, decomposition of 
computations and overall executive efficiency 

It is beyond the scope of any one research effort to 
study all or even most of the above mentioned subject matter. 

Some of these topics will- be covered as part of this research in 
the development of the SAMSON machine; i.e«, multiprocessor archi- 
tecture, processor-memory interconnection, memory-contention, 
memory communication and control, communications, instruction 
repertoire, parallelism, and global and local control. It is felt 
that the subjects covered' by this work, the additional subject 
matters listed above, as well as any additional topics uncovered 
as a result of this and other research efforts will be continued 
in future research activity. It is felt that SAMSOK will be an 
Invaluable aid to future research efforts in the area of multiprocessors 
since research in this area has a strong experimental and empirical 
component requiring research, design and construction of many 
systems. 

The SAMSON machine is in the Initial research stage of 
development and i^lll consist of four (4) uniprocessors capable of 
concurrent processing (as a strongly-connected multiprocessor). 


The first SAMSON processor element is presently being 
debugged. Similarly » the SAMSON test set is also in the debugging 
stage. One piece of support test equlpraent has been completed and 
is being used to assist debugging of the SAMSON processor element. 
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nr. Digital Computer SlrnulatiQn and Autoirjatie Fault Detectipn for Space 
Shuttle Electric Power Distributigii System 

by F. E. Thau and C. B, Park 


One aspect of our analysis of the Space Shuttle power distribution 
system has been Goncerned with the development of a flexible digital 
computer program, comprising a fixed main routine and a number of 
user-*- supplied subroutines, which simulates the dynainic performance 
of the distribution system given a specification of a sequence of mission 
modes and the load distribution in each of these modes. The program^ 
previously delivered to NASA, has been tested successfully on a network 
of about the size of one third of the proposed power distribution system. 

This program can be applied for the evaluation of competing proposed 
modifications in the electric distribution system, structure or in the ratings 
of eie nents of the distribution system., A description of the program is 
cGntained in section III. 1 , 

A second aspect of our analysis has been the development of tech- 
niques for automatic fault detection for applicatton in future space shuttle 
designs. Two approaches have been taken; a reduced, search procedure 
was developed to use a limited number of physical measurements together 
with a listing of nominal conditions to automatically specify which loads or 
cables are faulted. This approach is outlined in section III, 2. a. The second 
approach uses a dynamic model of the distribution system in the design of a 
sampled - data observer or filter whose inputs are the available measurements 
of voltage and current and whose outputs are the state of the power distribution 
system. This approach is outlined in section III. 2, b. 
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A number of problems remain to be studied in order to apply automatic 
fault detection for future space shuttle power distribution systems. First, 
the automatic fault detection schemes outlined above should be tested 
on networks of full size to determine the computational requirements 
that each technique would impose on the on-board computers. Second, 
automatic fault correction or reconfiguration techniques should be developed 
to fully implement an automatic fault detection and correction sub- system. 
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A configuration of the electric power distribution system of the space shuttle 
is assumed. A sequence of distribution modes of the power system associated with 
relay states is also given in real time . The operation of the power system follows 
the schedule specified by the mode sequence. Section I gives the analytic basis for 
the simulation program. Section n describes the program in detail and Section IH 
contains an application example . 



Assume that the electric power distribution of the shuttle spaceship (EPDS) 
is represented by a schematic diagram in Fig. 1. 
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In Figo 1 the non-linear function of I, fft) depends on the nature of the V-I character- 
istics of the given Fuel Cell (D.C. source). Assunie that all the elements of the 
power system can be approximated as linear, lumped, and are given as circuit 
elements for a given mode, i.e. , 

1) Dynamic of each load is described by a linear differential eq or by a circuit 
description 

2) For simplicity each relay is simulated by a Resistor R 

R = 0 .01 f2 when closed 
R s= 10® £1 when open 
with a delay time given. 

ii) Determination of System Differential Equation 

One can describe the dynamic of the given power system under the assumptions made by 
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output variables of the system 

auxiliary variables which connect some states, input and output 
variables 


A determination of the submatrices A^ 2 t A^^, A 22 , E, Gj, G 2 depends on how to choose 
the auxiliary variables and state variables as physical elements • 

From the standpoint of computer analysis of the complex circuit Ref. 1 the state 
variables are taken to be all the inductive currents and all the capacitive voltages. 
Moreover, let the auxiliary variables be all the node voltages and all the branch currents 
except for those node voltages and branch currents corresponding to state variables, i.e.. 



V. indicates the i-th node voltage (i = 1,2, . . . , NNode) 
Ibj indicates the i-th branch current (i = 1 ,2 , . . . , Ifar) 


Rewriting (1) in terms of physical variables Jb, g[ gives 
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Where the submatrices Fjg, F^s, are determined by applying KCL law at each 
node. Therefore, the NNode x NNode matrix F^i, NNode x3 matrix B1 are null and all the 
elements of the matrices Fj2> Fj^ are 1 if entering into the node, -1 if flowing out of the 
node, otherwise zero. And the submatrices F21, F22» F31, A22* B2, and E are deter- 
mined so that all the branch elements, resistors, inductances, capacitances and inde- 
pendent sources are defined, 
iii) Formulation of Normal Form of State Equation 

To formulate the normal form of the state equation from the system differential equa- 
tions ( 2 ) , one has to eliminate the auxiliary variables in eq(2) . For the elimination of 
the auxiliary variables a gauss-elimination method has been used, i.e. , from (2) one 
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Where the indicated inversion of the matrix is assumed to exist . 
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From (2), (3) one can obtain state equation 
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iv) Solution of Non-Linear Differential Equation 

The dynamic of the EPDS can be described by a set of non-linear di^erential eqs. 


q = A q + B u 
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where all the elements of A, B are given. 



A solution of the above non-linear differential equation becomes very much unrealistic 

when some of eigenvalues of A are very large. When Some relays are open that may give 

8 

this case since the relay is simulated by a resistor whose value is 0.01 if closed, 10 O 
if it is open. To integrate the system with some eigenvalues large, one has to take a very 
small integration step size which may take prohibitive large integration time . One may 
overcome such a difficulty by finding steady state solution of some state variables corres- 
ponding to very large eigenvalues as a problem of singular perturbation. 

V ■* 

® Computer Program of Simulation of EPPS 

Computer program of simulation of EPDS consists of four main steps, i.e., 

(1) Read data aeeording to the specification of a circuit's branches and their con- 
nection and formulate a set of system differential equations . Read data for 
mode sequence associated relay states. 

(2) Reduce the system differential equations to the state normal form. 

(3) Solve non-linear differential eq . 

(4) Check all the relay states according to a mode sequence and modify system 
differential equation. 

To see these steps refer to flow chart. 
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As shown in the accompanying flow chart, the SEPDIS program consists of six major 
subroutines, DATIN, RELAY, PLINS, FUEL, ALOUT, RINPG. 

SUBroutine DATIN (KECT, VALUI, NR, NL. NC, NV. NI, M, NCV, NCI. NEQN. 

NVAR. NOU, NN, NZ. NAUX, NK, Kll, NQ) 

DATIN reads all the circuit elements, resistors, inductors, voltage sources associated 
with two nodes and store them in KECT and in VALUI (corresponding numerical value of 
element), i.e.. 

Outputs: KECT, VALUI 

NR: number of resistive elements 

NL; 

NC; 

NV; 

NI; 

M; 

NCV; 

NCI; 

NAUX; 

NVAR; 

NEQN; ■ 

NOU; 

NINP: 

NN, NZ, NK, Kll, NO are wor Icing variables 

SUBroutine Relay (NRLAY, R. IRY, NR, T. TSEQ, NMQDE, UK, KECT, VALUI, VN) 

Inputs; NRLAY, R, IRY, NR, T, TSEQ, NMODE (READ), UK (Main), 

VN (see note) 

Outputs: KECT, VALUI 

Note 


Note that a diode can be modeled by a relay associated with two adjacent node voltages. 
However, in this subroutine simulation of diode has been neglected at the present program. 


SUBroutine PLINS (A, B, KECT, VALUI, NR, NL, NC. NV, NI. M. NCI. F21. NK. Kll. 
NCV, NEQN, NINp, NVAR. NOV, NN, NSVR, NO, NZ, NAU'X, .GI, NRPT) 

Inputs; KECT (DATIN). VALUI (DATIN), NR, NL, NC, NV, NI, M, NCI, 
NK, Kll, NCV, NEQN, NINP, NVAR, NOV, NN, NO, NZ, NAUX, 
(all from DATIN) , NPRIN (READ) 

Outputs; A, B, F21, Gl, NSVR ^ 

A; System matrix F21 = [ai, J Aj 2 j > Gj 

B; Input " (Seeeq.l) 

(see eq.5) ' 
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capacitance voltages 
capacitance currents 
auxiliary variables 
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number of outputs 
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NSVRj number of state variables 


SUBroutine Fuel (YIN, X. NDee, NVAR > 

Input: X (from read initially and from integration subroutine RINPC) 

NDeg (read), NV-AR (DATIN) 


Outputs YIN 




YIN = 

32 ' 


f(Il) 


32 


fdz) 


32 


fds) 

k 


(see fig. 1) 


where the non-linearality must be programmed in the subroutine fuel. 


SUBroutine ALout (F21, YIN, X, PN, VN, NSV, ITK, T, Numbr, NDeg, NLoad, NNode, 

NAUX. BRC, W, Gl, NINP) 

Input; F21 (PLINS), YIN (Fuel), X (RINPC), NSV (NSVR), UK, T, Numbr 

(Main), NDeg, NLoad, NNode, NAUX (Head), Gl (PLINS), NINP (DATIN) 

Output: VN, PN 

VN (® ) : all the node voltages, branch currents 

PN («)s instantaneous powers at all the loads. At the present time, PN (») 
has not been programmed 

SUBroutine RINPC (X, DX, T. Step, ISW, Rmax, Emax, NDeg, NLoad, IJK, YIN, A, B. 

NSV, Numbr)- 

Subroutine RINPC is followed by subroutine DERIV (DX, X, YIN, A, B, NDeg, NSV); 

In the S ubroutine Deriv Input-Output specification is as follows: 

Input: X (RINPC and initially read). YIN (Fuel), A, B (PLINS), NDeg (Read), 

NSV (=NVAR) 

Output; DX describing the given dynamic structm'e of the system, i.e. , 

DX = A X + B U 

The arguments of the subroutine RINPC is as follows: 

Input: DX (Deriv), Step (Read), Rmax (=TSEQ (NNode) , NDeg (Read), NLoad 

(Read), YIN (Fuel), A, B (PLINS), ISW, UK, Numbr (working variables) 

Output: X 

An integration routine in RINPC is based on the fourth order R-K method. 

Restrictions on the Present Program 

1. ’ The dynamic system is assumed to be at rest 

2 . Only circuits having 60 system variables (= number of auxiliary variables + number 
of state variables) . 10 state variables , 3 inputs , are allowed . 
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Input to Program 

The present program accepts circuits having up to 60 system variables (= number , 
of state variables + number of auxiliary variables), 10 state variables, 3 inputs, no 
restrictions on output variables . 

(1) Data-input to the Program 
Block 1 

i) The first card provides the number of relays, the number of modes, the number of 
loads, the number of nodes and should be punched in the format 

13, 13, 13, 13 

ii) The second card gives an initial integration step, initial time, and frequency of print 
out in the following format 

E15,5, BIS'.S, 15 

Block .2 

i) The first set of cards in Block 2 provides the mode sequence in real time and is punched 
in the format e 


ii) The second set of cards gives che locations of relays in terms of node label, i.e., 
IRY (I, J), J=l, 2 will identify the I-th relay by two nodes. The format is, 


213 

iii) The third set of data cards in Block 2 provides all the relay states, i.e. , R(I, J) 
indicates the i-th relay state between the j-th mode and the mode. The third set 

of data cards should be punched in the format 

4E15.5 


Block 3 

Block 3 defines all the branches of the circuit to be analyzed. For each branch a single 
card should be punched 

EL (I,J) = +A 


in the format 


lAl, IX, 13, IX, 13, 2X, E16.5 


The integer i and j are the initial and terminal node labels of the branch, respectively, 

and EL defines the kind of branch 

EL-R if branch element is resistor 
EL=C " " " capacitor 

. EL=L " " " " inductor 

ELs'V " " " '* Volt^e source 

EL=I ” " ” " current source 
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The number A, the numerical value of the branch element ELj the number A may be 
any real number if EL=V. The last four cards in Block 3 should punch in the first 
column for each card 0 

C Example 1. Example of Input Data Format 

Consider a following two loads problem with one input as a test problem. 



Assume that the system is rest initially . Analyze the transient response of the system 
from 0 to 1 second when the relay between the node 1 and 2 is open at time 0.5 sec. 

f(I) « Exp. ( -.0, 01^1 ) 


Input Data 


Block 1 


i) The first card 3 3 2 6 (in the format 413) 

ii) The second card 0.01 0. 10 (in the format 2E15.5,I5) 
Block 2 

i) The first card 0 . 0.5 1. (4E15.S) 

ii) The second card 1 2 (213) 

The third card 2 3 (213) 


ill) The fourth card 111 

The fifth card 111 
The sixth card 10 0 


(4E15.5) 

(4E15.5) 
(4E16. 5) 



Block 3 


R I 2 1. E-2 

R 2 3 1. E-2 

R a 4 I. E-Z 

R 3 4^2.2 E O 

R 46 1. EO 

L 6 O 1. E O 
L 5 O 1. EO 
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Example 1 “PRINT - OUT 

State Equations for Each Mode 

M & ie i H ogle 3 


STATE EQU AV !ON PRINT OUT ^ —^S TATE'EOUAT ION PRINT^^OUr 

){(4ci(c:jc4c#iSti{c)9(!4t:^:gc#:^^i{()ici(c ■ . «: # •S' * «>Ssi<:Xe# 



Q^FCTDR^I^SXATE VARIATES 

GTVEtTUR OF STHTE V ART ABLE S 


inouctuve branch current 5 0 

INOUCTUVE BRANCH 

CURRENT 5 

0 

INOUCTUVE BRANCH CURRENT 6 0 

INDUGTUVE BRANCH 

Current 6 

0 


X-'VEeTGR-OF“INPtlT- VATTIABITES X VEGTOR OF INPUT VARIABLES l» 


voltage source 0 1 

voltage SOURCE 01 

j 


1 Kilt u i^N r n Yn*i ATirm 

H 

MTATRTX - 

IDfeWn i Y i*lAl KTK 

A MATRIX 

A 

MATRIX 


I J A{I,J) (NON-ZERO) 


I J 

AdffJ) (NON-ZERO) 

1 1 -2.22000E 00 


1 1 

-2.22000E 00 

1 2 -1 #000006-^02 


1 2 

-9V99999E-03 

2 1 -9.99998E-03 


2 1 

-l.OOOOOE-02 

2 2 “leOZOOOE 00 

1 

2 2 

■^1 #000006 Off~ 


B MATRIX B MATRIX 


T“~ — d“— — BriirJT — TNON=^ ERO-) ^I ~J Bl I » J ) ( NON^ZERO ) 

1-^ 1 ^l iOOOOOE- 00 1 1 l aOOOOOE 00 

2 I l.OOOOOE 00 21. 1 ,000006 00 
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Hot>£ DE$0^IPTI0N 

Hope i ( 0 4 t<. os) 

Kija , Rlj? i Ris, I6\; Ri>/7, 

Ki.ir are chssJ 

J %3 , Si.it- 

Mope 2(0-^ 4-'^<-^) 

Ri.a, Kz,i/ ,Kui4-,Ri%is-, Ris-,tt i ^hn, Rir,/i 

^i./r / Rt,ib , Kij/s , Rhu are closej 

Rhs I Rhii are ofQ^ 

hoPg 3 (~ t = 1) 

' ' ' mi 

STof 0fej?4r/P/l/ 

fi/rO All the Bn^cn caRRB.h)Ti. h/aOE. 

VOLTA f£S>. . ' 



A MATRIX 


<NON-ZERO> 


I J A(I»J) 


1 1. -1.0OZO3E 31 

1 2 -I. 0000 3 £ -3^ 

2 1 -1.33333E-32 

2 2 - 1.202036 01 

2 3 -i*33773S“09 

2 A -1.3 8773 6^39 

2 5 ^.65&61E-13 

2 6 9. 31322E~10 

2 7 -1.19&93E-39 

_3 3 -9 , 0 909 2 E_ OJ 

4 A -8o 3 3 3 3'^" I 3 7 

5 5 -2.2 006 2B 3 1_ 

5 6 - 2 . 5 OO 62 E -33 

5 7 -l o25Q01E-33 

6 5 -2«5&0blE-33 

6 6 -» .^03 53E 31 

6 7 -2.50002E-33 

7 5 = 1 ; 55251 E-D3 

7 6 ■■3.1 2502E-33 

7 7 “5«0OO73E 01 

8 8 -2,7231SE 31 

9 9 -275 012 5 £31 

10 10 -i.ooiooe 32 



I J B(I»J) I N Q N -2 E R □ ) 


1 

1 

1.30333E 33 

2 

1 

9, 99993E-31 

3 

1 

9.D9391E-31 

^ ^ 

1 

8o33333E-3l 

• 5 

1 

1.333D3E 33 

6 

1 

1» 30003 E 33 

7 

1 . 

lo25303E 03 

8 


“9.‘09391E-D1 

9 

1 

1.25333E 03 

10 

1 

1.0OOO3E 01 



-1.00003E 33 
-9.99999E-33 
'^9r9999 9E-33 
-1.333DDE 38 
•^7391 8 2 E’ or 
*Bo3333^E 37 
-2.2(ir5FE'7)l 
-6, 5 5 56 5 E- 3 3 
"-3.3333VE-33 
-5.65557E-D3 
•^A'.730^7E~31 
-3.33335E-D3 
-^.i5557E-33 
-4.15557E-33 
■^.OOOBBE^Sl 
-2. 7231 BE 31 


97 :-2,50125E 31 

10 10 -rOOlODE 32 


_B_J1ATRIX„ 


I 

J 

(MOM-ZERO 

1 

1 

l.OOOOOE 00 

2 

1 

1. 30030 e 33 

3 

1 

9, 39091E-31 

4 

1 

8,33333E-01 

5 

1 

9.99999E-31 

6 

1 

l.OOOOOE 03 

7 

1 

1.25303E 33 

8 

1 

9. 39091E-31 

9 

1 

1.25303E 33 

10 

1 

1.00303E 31 
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AI.I, !PHE BRMgH CURRENTS AND NODE VOLTAGES PF_ EXAMPLE*«2 
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III. 2. a Combinatorial Method 
2 . a . 1 Pr e 1 im inarv 

Consider the Electric Power System described by 
equation (5) of section III.lo Assiame for simplicity a 
single- input system, and J- CS,') = 0. Then 

'j - 

where „ . « t va., “i 

Bv. » 

and ^ is the measurable output vector ^ ^ t ^ rl . 

Assume that faults occur due to openings of some relays 
connected to loads. The problem is to detect faults of the 
system and to find their locations from the output measure- 
ments . 

One can detect faults simply by comparing the actual 
outputs with all the combinations of possible f au It- output s . 
However, thxs method requires ^ comparisons for a single 
measurement output. In the ease that one could measure r- out- 
puts, the number of comparison can be substantially reduced to 

r 


where 


21 
1=1 

Tl;^ is the number of loads in the 


6 . 

group associated 



Pig. 2.1 






2. a -2 Procrram Description For Conib inter ial Method 


The program consists of main two steps, 

(1) Partition the original system into NG Groups, 

For each Group find the ideal steady state load 
currents 

(2) For each Group assoc iatcid with the corresponding 
output, find all the possible faulted outputs in 
each Group if some faults are indicated in that 
Group and compare them with the corresponding 
actual output. (Refer to flow chart.) 


As shown in the accompanying flow chart (see also the 
program listings) , the FAULT Program consists of two major 
subroutines FAULII, COMET 


(i) Subroutine FAULII (N,. A, NG, Yact, LL, AL) 
Inputs ; 


Ns 
NGs 
LL(K) s 

AL(K) s 


Yact (I) s 


the order of system (the number of loads) 
the number of partitions 

the number of loads in the K-th partitioned 
Group (K=l, 2, , . . ,NG) 

the k-th ideal load current. - 



the I-th output load current flowing in the 
I-th partition. 


A(j)=AL(Ii+iI) , the J-th load in the II- th partitioned 
Group. 

YIDEL(I) s Ideal load current flowing in the I-th 
partitioned Group, ■ . 

(ii) COMET (N, A, Yact, INDEX) 

Input s 


Ns the order of system 
A (FAULII), Yact (FAULII) 
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Output s 

Index indicates the index-th faulted partition. 
This program can detect faults of order less than five. 
Example; 



Assiame that the above system was operating in the steady 
states. Ass-ume that suddenly the actual output measurements 
Yact(l), Yact(2) are different from the ideal ones and 
that they are 

(i) Yact(l) =5 o - . 

Yact(2) =0 

(ii) yact(l) =5 
Yact(2) = 3 

find fault locatigns for each case. 


Input Data 
Bloch 1 

(i) Read N 5 (in the format 15) 

(ii) ReadAL(I), 1=1, ..., 5 (4E15.5) 

(5., 3.33, 2.5, 2., 1.) 

Block 2 

(i) Read NG (15) , NG=2 

(ii) ReadLL(I), 1=1,2, (2,3) 



180 ) 


Printouts for each case 

(i) Yact = 5<, Yact{2) = 0^ (ii) Yact(l) = 5 
Yaet (2) =3 

are as follows; 

m ■ 

I' A DIAGNOSIS OF THE FA'JLT LOGATIONS 

♦ ♦#«>}: ij. * ***♦«« 4: « 


A SINGLE FAULT WAY DCCUP AT THE 2-TH LO AD IN ThE ■ l-TH GR OUP 


THE TRIPPLE FAULTS MAY OCCUR AT U 2, 3IN THE 2-TH GROUP 


THE END OF DIAGNOSIS 

ifr ♦ ♦ ♦ # ^ ^ * * ^ * # <« 



m - - 

■ 


A, diagnosis of the fauo locations 

:{e ^ « ij :ct >f: * 4: « 4: « * 




A SINGLE FAULT MAY OCCUP AT THE 2’- TH L O AD IN__TH E ijrXHJJ 

A SINGLE FAULT' MAY □CCU'^ AT THE TMET 03 AD IN THE 2-TH GROUP 


THE END OF DIAGNOSIS 

«««« #« t- «« * 


® page is 

POOfi QUAIJtb 



DIMENS ION A( 20) 9 YACK 10 ) »LL UG) » AL ( 2C ) 
READ! S 9 ID N ■ ~ “ 

READ159 15)(AL( 1)9 I = l»N) / 

READ'( 59~11) 

R6AD(59 D)(LL( 1)9 I=l9NG) 

■^■YACn D = 5 «' ^ 

YACT(2)=0. 

CALL PAUL II('»,A9NG9YAGT9LL9AL ) 

FORMAT! 15) 

F CRM At ( A E r5r*5i 


SUBROUT INE FAirin ir!^ 9 NG, YACT 9 LL 9 AU ^ 

DIMENS ION ALl ^0)9LL( 10)9YACT{ 10 ) 9 YI DEL ( 1 0) 9 A { 2 0) 
"TSIO" NUMB ER' ■QF~UO A DS 

ngo number of groups 

L'LTn 0 “NUM 8 ER'“0 F LO A DS~FDR“ THF^ I- TH^GR 0 UP 

M2=LL(l) 


Nl=NG-l 


YIDEL(K)=o.O 

— WR I T F( '6 T5 0 0 ) — — 

WRITE{ 69 55G) 

FO RH Al^( ’ *”^*~*^ *^^ * # # <:l{c * >}: / '/") 

FORMAT! « 1« 9 5 X 9 “ A DIAGNOSIS OF THE FAULT LOCATIONS' ) 

M 9 " 

YlDELfK )=TTDEL(K ) + AL( I ) 

TF(TK + 1 ) , 6 T -;.N G )— GO “ !□ 

M=M+LL(K) 

“ M 2= M2 +L LIK'D 0 ^ 

K-K + 1 

~1T0~TI]~2B 

CONTINUE 
DCr'3D‘ I=T,NG 

;F( ABS(YACT( T )-Y IDEL( I ) ) cLEoOoOl) GO TO 31 


INOEX=I 


IFI INiDEX oLE. V) GO TO A 6 

n = n + Lt n N D F x - n 

GO TO ^16 
D0“41 ^ J =17 
A(J)=AL(II+J) 

Y'ACT V= A'BTrY FCTmDTD EnX)~T 
CALL COMBT(N« A, YACTV » INDEX) 

CONTINUE 

CONTINUE 

^ j ^ ( 5 ^50 y j ^ 

WRITE! 69 551 ) 



FORMAT! ■" 


-9 « «x<'«%'7777 ) 


FORMAT! /////, lOX, ' 
RETURN 


DIAGNOSI Sn 


SU BROUT I N ECn M BT I N 9 A 9 Y AC TL 9 1 NDE X ) 
~CTMENS ION Ar2G)9 B! 20) 

DO IQI^IjN 








55 

-56 

57 


62 

63 
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300 


IFt ABS( YAGTL-^t U) »GEoCo0n GD TQ II 

WRITE (69 300) I » index 

FORMAT (// 5X» » A S-INGLE FAULT MAY OGGU'R A 7 THE » , I 5 » « -TH s , 
12X»®LOAO IN THE S I5i> «~TH GROUP « ) ' 


5 8 11 GGNTINJE 

59 10 CONTINUE 

60 IFINoLTo2 ) GO' TO 20 0 

61 Nl=N-l 


DO 20 I=UNl 
Il-I+l 


6^ 

65 


66 

57 


59 

flxi 


:L 


71 

12 

73 

74 


DO 20 J=Il,N 
B( J) = A( I )4A(J ) 

I F ( ABS { Y AGTl - B{ j ) ) o Gt ,0.01 ) GO TO n' 
WRITE{6,301) I»J, INDEX 


68 301 F0RMAT(//»5Y» « THE DOUBLE FAULTS MAY OCCUR A T » , I 5 j ® ® » 1 5 » 

1«TH LOAD IN TFESI5,'’“TH GRDUPM 


21 


20 


GONT INJ E 
CONT INUE 


IF(N,LTo3) GO TQ 3000 
NI=N-2 

C50 '30 I=l9Nl 
N2=N-l 


75 

76 


77 

7B 


79 
8 0 


81 

82 


11=1 + 1 

DO 30 J= I UN 2 
' Jl=J + l ■ 

DO 30 K-JI 9 N 
B ( K ) = A { D + At J ) +A ( K ) 

1F( ABStYACTL-PtK)) .GToO.Ol) GD TO 31 


302 


WRITE(6, 302) I9J9<» INDEX 

FORMAT(//95Yv "THE TRIPPLE FAULTS MAY OCCUR AT "9l5»' 


" 9 ! 


9 9 


nSj'IN THE"9T5»*-TK GROUP") 
83 31 CONTINUE 


84 30 CONTINUE 

35 3000 IFIN.LT.A) GO TO AQO 


86 

87 


88 
j 89 
I ' 90 
91 


Nl=N-3 

DO 40 I-l,Nl 


1 1= I + l 
N2=N-2 


DO 40 J = Il9N2 
J1=J + 1 


92 

93 


94 

95 


96 

97 


98 

99 


10 Q_ 
10 I 
102 


“ 103 
- 104 
! 105" 


N3=N-1 

DO 40 K=Jl9N3 


Kl=K+l 

DO 40 L=Kl9N 


B(K) = A( I) +A( J )+A(K )+A(L ) 

IFt ABStY ACTL-B{ L-) ),GEc 0,01) GO TO 41 


303 


41 

'40~ 

200 


WRITEI 69303 ) l9JfK9L 9 index 

FORMATt //»5V9 "THE QUADRAPLE FAULTS"9 
1 15 9 "7 159 " 9'' 9 159 " ; « VIS,""' IN"THE~" 9l 5 j^'^GRROUP " V 

CONTINUE 
'CONTI NJE 
GONT INJ E 


400 


CONTINUE 

RETURN 

END 


lA i 
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in 2, b Fault Detection Using Dynamic Observer Networks 


We assume that measurements are made at discrete sampling 
time instants and, hence, the dynamic model of the distribution system 
of section 111, 1 is converted to a set of difference equations: 


X { k + 1 ) = A X { k ) + B tf_ . (1) 

where the available measurements are 

X ( k) = C X (k) (2) 


The vector of inputs ^ are "pseudoinputs ” that do not exist in the 
actual distribution system, but are introduced in the fault detection system 
design to simulate possible fault conditions. For example, f_ = 0 might 
represent the no-fault condition and ^. = a vector of constant components^ 

would represent a faulted condition characterized, in the case of an open 
circuit, by having a state component x. = 0 , A dynamic observer will 
be designed to use the available distribution system measurements to 
obtain estimates of the state of the power system and thereby determine 
whether a fault condition exists. 


A discrete dynamic observer is a sampled-data system, described by 

x{k+l)=Fx(fc)+Kx /gx 


y (k) - C X (k) 


where 


F = 


A - K C ' 


( 5 ) 
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I ■ 

Note that the inputs to the observer are the measurements ( 2 ), 

The gain matrix K is selected so that the state of the observer x (k) is 

an estimate of the state x (k). Using (!) - ( 4) it can be shown that the 
observation error 

e (k) = ^ (k) - ^ (k) (6) 

and the difference between actual measurements and observer outputs 


y {k) = y (k) - y (k) 

satisfy 

e. ( k + 1 ) = *F e ( k ) + B f 

1 (k) 

t 

where 

K ( o ) 


= C £ { k ) 

= X ( o ) - X ( o ) 


( 7 ) 

( 8 ) 
{ 9) 

( 10) 


The automatie fault detection strategy is as follows: a sequence of 
measurements of the distribution system x( 1 )» 2 L{ 2 ), ..,,y(n) 
are made. Then, with the observer initially set in the zero s;;-j:te, 

X ( 0 ) = 0 , the sequence y ( 1 ) , y ( 2 ) , . , . , y ( n ) is obtained 


from ( 3 ) and ( 4 ) , Then, the following matrix equation is obtained : 

cr j CB 

CF* I C(f6 + B) 


for the unknown e 



f 


II 



e (0) 

II 


1 I 


7 ’ 


( 11 ) 
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Assume that when the measurement process, starts the power system 
is in either a faulted or a no-fault steady- state condition, x (O) = • 

Then, from (1) 


e(0) = (I-A)'i Bf 


( 12 ) 


One can rewrite (12) as 




’ M„ ; 

r<\ 






Thus, 

e, = M„f} + Mi 2 f^ 


Also one can rewrite (11) as 



(13 ) 


(14) 


(15) 


Eliminating the in (15) using (13) , (14) gives 


‘ ^ n 



■f/' 

! t 









( 16 ) 
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Where = 0 (fault) or ®Cn ( no-fatilt), and 0(fi is a given constant. 

The problem of fault detection is reduced to the problem of 

a 

solving the linear algebraic equations in (16) for fj and e^ , 

One must be careful in designing the observer system to guarantee that 
(16) can be solved uniquely. As yet, general conditions that guarantee 
uniqueness have not been obtained. The example below illustrates 
application of the approach to a simple problem. 
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V 


Example 


jonsider a system, illustrated by 



oZ-nG Lo(Kci 


One can formulate a corresponding discrete system, with zero-order 
hold as follows: 

t‘ Ym °h 


t. 


xi[(^+0tJ 


p -0.69T 

e 


0 





ft MAT) + 


Ay 


tj' 


^(^t) = X i (RT) + “£ 6(iT) 


where 


f' = 

" ir-l 




aj 


the measuring output. 
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One can also obtain the corresponding observer with gain vector 
K so that (F, C ) is observable, whe re 


F = A - K C 


Choose K 





Then the corresponding observer can be built as 


X (k + 1) T = F X ( kT) + K y* 


Where y is an actual output measurement. 

Using the program listed below we have simulated the automatic 
fault location procedure for the following output measurement: 

( i ) y*^= 7.2464 
( ii) y^= 4. S4S5 

(iii) y* = 11. 792 



Input Data to the program; 


N = 2 

C(l) . = 1, C (2) = I 

K(l) = 18/25, K (2) = 14/75 

A(l,l) .= Exp. ( -0.69), A (1,2) = A(2,l) = 0.0 

A(2,2) = Exp. ( -1.1) 

B(l, l) = (1-Exp. (-0.69) )/0. 69 
B(2,Z) = (1-Exp. (-1.1)) / 1. 1 
B(l,2) = B(2,l) = 0. 0 



THE OUPUT r'EASUREMENT Ys 


0-72<i6AE Gl 



THE M X 2N ^^flTRIX 


^ 4 ){i « :{c ^ ;>E ){< « lit $ 4 « » 4 « It: lie 4e 

-0o40509E 00 ■ -0.57380E no C.7223’5E 00 


Gr60648E' CO ‘ 0 , 19559E 00 


77E 00 0.42974E GO G«25848E GC 


‘F? 1 ) » F ( 2 ) 7; cToTF f K- 1 ) , E( M ) ~'2 ' " ARE ~ 

4 4 4 « 4 « 44 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 « 4 4 # 4 4 4 4 4 4 4 * 4 4 # 4 


0 o 5 0 0 0 0 E 0 r 0 o 48601 £-0 6' 


i/Vi 


{\\) THE C U PUT MEASUREMENT Y= Qo454£5E 01 


THE K X 2N PAIR IX 





t 4 4 « >!< !?! * 4 4 4 4 4 4 « # * « ♦ 4 4 4 44« 4 444444 44 444444 

-0 o 405 0 9 E GO o"57 3 8 0 E' n Q ^077223 5E 0 0" 


r6"G6^8E CO Oc 1955 9E 0 0 


“OT2'OT77^^O10 0T4"2'9TifEnj0“ 


10r2W48E ■ 00" 


T « I ) » F ( 2 ), 9 * o o77 F ( N- 1 )7 E ( N Vr'M = '“2 ARE 

44444 4 4 44 444444444444444444444 44 444.4 4444 





THE N X 2N MATRIX 



T a ITF( 2 rriTTVpT s 2 — SRc~~ 

<< :{S it**** J} «##!{=«.! I 5i<:Si 3j! Jj! S}: 1{|; * 





ORIGINAL 
OF POOR QUALITY 










fwic/i^AMO F^ULT T)ETgCrtC>K> ^RO^WaM 




$JQB 


1 

2 

3 


DIMENSION 

N=2' 


A( in, IG l*B( 10,10) ,Ct 1C) jAKUO) »AP (3) »FF<2) 
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AF( 3) = 5o/lo U5./0.6q 


1 

n~“ 

I 5 

1 

aFri)H5T7Ta 

AF( 2)=5o/0.69 




6 


FFri) = n.0' 



T 

7 


FF(2) = 5* 



ii 

8 


DC" 100 IJK=r,3 




9 


WRITEt6,500) 6F( UK ) 




i 

F0ftMATT'^l«,/777'® TFEIoUPUT MEASUREMENT V = «,E15,5,//) 



1 

ill 


NN=1 



li 

; 12 

300 

XCNTINUE 




13 


CU ) = lo0 



T1 

1 


C(2)=U0 



LI 

i 15 


AKH)=l0,/75* 




16 


AKT21^4T77 5~, 



-fi 

IT 


l) = 5, • . 



1 

T8 “ 


All, 1 ) = exp(-o;69i 




19 


• AC1,2) = G. 




,20 


AC2,I)=0o ■ 



II 

21 


A(2,2) = EXP{“1«1) 



T_j 

rZ2 


b 1 1 , 1 J = 1 1 o- EXP 1 “0 o69 ) ) / Oo 69 




23 


Bll,2) = 0. 




r2'4 " 


B12, l) = 0o 




125 


B(2,2) = ( l»-EXPl“lol) )/l,l 




26' 


F0=FF(NN) 



; ' 



GALL FAL)LT(AF( UK ) , A, B ^G , AK , U, N,F 0, 1 NDEX , T ND ) 



“t 

1 

rz8 


1 Fn NUTX o FQ oO . A"N DTTWD dO » I 200 




29 


NN=NN+l 




30 


IFlNN.GToT) GO TO 2C0 



) 

31 


GO TO 300 



“L., 

hz 

200 

CONTINUE “ ■ - 




33 

100 

CONTINUE 



1 !3^ 


STOP 



L 

35 


END 




f 

) 

36 


S U BROU T I N E F AU L T ( Y 0 , A , B , G , AK , U , N , F 0 , 1 NDE X , I ND ) 

1 

37 


DIMENS ION AKT IG ) V Y ( 1 0 ) FF ( 10 ) , XI 10,10) ,A ( TO, 10) 

L 

38 


DIMENSION 6< 10, 10), Cl 101, 0( 10 , 20 ) , St 2 1,D 1 2 ,2 ) ,BB(2) 

H 

39 


U 1 P'hNb 1 UN L'U( 2, 2), Flic, iU) 

!l 

40 


LMN=1 

i! 

"41 


NUMBR= 0 


42 

171 

continue 


43 


IND^O 

1 

44 


INDEX-D 


45 


N3=N-]^ ■ 

r 

i 46 


NUMBR=i+NUMBR 


47^ 

. 

DO 5 1=1, N - 

L 

•48 


CO 6 J=1,N 


47 

6 

XSl,J) = 0o 

r 

jso 

5 

CONTINUE 

i!5l 


UD 10 1= 1, N . 1 


52 


DO 11 J=1,N 

J1 

s53 


■'OlI,J) = 0o ' “ ■ ■ 

1154 

11 

CONT INU E 


55 

10 

CONT INU E 


,S6 


DO 20 1=1, N 


fyT 


~~"D0 21 J = l,N 


“ 58 


GC I, Jl = AKf I )«C( J ) 



§59 

160 

21 

20 

CONT INUE 
CONTINUE 

61 


DD 30 I=1»N 
CO 31 J=l»N 

|63 


FI I» J I=At 

*64 

31 

CONTINUE 

-65 

30 " 

CONT INUE 



N2=N=!'2 


6B 


. 193 ) 


■"C'ALX~MAT FMrN7C7Frff70‘ 
DO 40 I=lvN 

“DX ]- 4 

A( l9J)=«A( I,J ) 

CONTINUE 

DO 50 

— riTT ) = Arirm iT(T" 
CO 60 J~1»N 


BB( t ) = B( I« J ) 

CQ"“6'5“1J K= VtH • 

DO 62 IJJ=1,N 

— DiTjKrn^ )=s^rnK 7 TO r" — — 
CONTINUE 

rAnr"GA'a s's rMTCTBiTSTTim 

DO 63 K=l9M 

Dci 

eONTINME 

CO"^0~I=T'*'N ■ 

DO 71 J=UA». 

— c(i» j )=~rcrrTtn ^ 

CONTINUE 

COrTTlNOE 

Nl=N-l 

D 0 “ 8 IT“I^rPT — - 

DO Bl J=UN1 
A1 I , J")=0“TD 
DO B2 K^l»Nl 

A(xrJ)='Aixi“J7wrr,K7T?q3rir,'37“ 

CONTINUE 

GDWTTNUE 

DO 90 I=lfNl 

Dcrgn^TTN'i ""r "• 

CD( I, J )=A( I» J )+0( I, J+N) 

^CONT TNU E — ~ 

DO 10 0 I= l9N 

DDt l9 N TTTNl 

DO 110 I-l,Nl 

S^t i 1=0^ 

DO 111 J = l»fM 

^ 1 ) = S I T T +0 ( ITT) ^TirJTMl 

CONTINUE 

““CO" 120 1 = 1 »'N1 

s( n=st n+o( i, 2 #N) 

CA L“L~0 8 S C Y t K iY rNTY OTCT^ — 

DO 130 l=l»’Nl 

BBii )=Y n T-s n r*Fo~~"““ 

DO 140 I=1»N1 

DO ( N^ n = 0 .0 ““ — — 

DO lAl J=1»Ni 

oDiNnn5TciN7nwmrJi'*!‘OTT 

D D ( N J n = DD ( N» T ) +0 ( N » N + I ) 




rwo-r 


“133“ 


133 

6000 

ri39 

1 Uo 

2600 

lAT 


. 1A2 


1 ['A-3 

- 5500 - 



IT5 

2100" 

i IA6 

5501 

-| Uy— 

550'2 

LAS 

2000 

:1A9 

3000 

po 


~15 1 

v-i 


C'O'NTINU'E ““■ 

CCCtM,N )=□* NsN ) IS 

R='0 ^'O ^ ■ 

DO 160 1=1, W1 

R=R+0 1 M7n ♦ D (“17NT 

R=R+0(N, Z'frN 1 

BB I N )=Y( N )-R*FO — 

CALL GAUSSCn»CD,-BB,S, ILL ) 

C0“'l70 I=lVlM3~” 

IF( ABS(S( n-iJ ).LEaO»OOl,DR.ABS(Sm I.LE.CcOOl) GO TO 170 

rNDlJfS'l ^ ^ : 

CONTINUE 

“TRABS ( s IN )-Fo ■=?DTNT^n“^;LH7cTdCl7" rNo=r 

LMN=LMN+l 

“I F{ INDEX oEQo*^ .AND. INC.EO . 1» "60“ TO eOGO"" 

IFINUMBR.GT, 1) GO TO 5C00 

glQr-^ 17 £ — — 

WRITEI6, 21001 

■ W R I T E ( 6, 5 502 1 : 

WRITE(6, 20^0 M (0( I , J I, J = 1, N 2 ) , I = 1 , N ) 

WRIT Ef6; 2 600) 

FORMAT!//////, » 0 ) 

¥RTrR 6755'0TiT'n — ' 

WRITE! 6, 5_50n ) _ _ _ _ 

FORMAT ! * « V » «#«!!<■#« j7/ ) 
WRITE!6,2000)!S! M,I = l,N) 

Form ATI / //V®' 'T H E n x 2 n matrix®,/) 

FORMAT! ' ® , ® F! U, F! 2), . . . . , F ! N- 1 ) , E ! N ) , N= « ,I 3 ,2 X , » ARE » ) 
FORMAT! ° ® , • 

FORMAT!® «, BE 15.5) 

~C0NTINUE“~ “ ' 

RETURN 

"END ““ ^ ^ 


on, I+N) = Q( 1 , I4N )+C{ J )«B( J, I) 

CONTINUE 
CO' 40 UK 
DO 42 J=l,N 

“10T~Ulf7T+NT^7C)‘ — — — 

CO 43 K=l,N 

on j K , j +N ) =0 <1 j K , j4W ) +o'( rj RnTTKy*^ n< , j ^ 

CONTINUE 

CONT I NUE ■ 

CONT INUE 

GTfNT TNU E ~ 

DO 611=1, N 

D0“60'J=I7N^ " ™ 

Q( I, J)=Q( T,N+J » 

— CONT INU E ^ — ■ ■ ■ ~ ^ 

DO 70 1 = 2,N 


71 

01 I, J+N ) = 0I I, J4N l+QI I^ 1, J ) 


“'70 

CONTINUE 

RETURN 

' 


SUBROUTINE OOSDY(X,Y,N,YO,C) 

DiMBNS ION XI 10, 10 ), Y( 10),2( 10) ,Cl IG) 

z ri ) = 0 , 0 “ — 

DO 10 K=1,N 

Xr 2,'K+T):={ EXP{ -1% 1“)- r47775n’«‘Xr27K l4o'/7ro4Xir,K) + l4o /75 >Y0 
XI l,K+l) = (EVP{-0o69)-18o/25o )*X( 1,K)-18. /25o (2 , K) + 1 8, /25.o #Y0 

Z( K + D =Q ro ^ — 

DO 15 J=1,N ^ 

CONTINUE 

CONTINUE ^ — 

DO 20 K = 1,’N 

YTK ) =Y 

CONTINUE 

JVETU RN 


subroutine G.''USS(N,A,B,X, ILL ) 

IMEN S 1 0N“ ATN VN')1 BTM ITXTNT 

ILL = 0 

TFT N- 1 ) 4Ti; 4 ^ 

IF(A( 1, 1) )2,.3, 2 

XT 1 ) = 8 TT ) f AT ITU 

RETURN 

— nursT 

RETURN 

NLTSS1=N-1 — 

CO 13 1= UNLESS 1 

E IG=ABS( A{ l,n )-) 

L= 1 

“TP LU S T=“I +T 

DO 6 J=IPLUSl,N 

— IF( ABS ( ATJ, I ) )-BlG)-“6T6fS^' 
eiG=ABS( A(J, I ) ) 





36 

‘^38 

J39 

8 

RETURN 

IF(L-n5jlU9 

196) 

9 

CO IG J=1,M 
TEMP=A(LjJ ) 




AILfJl = A(!fJ) 



10 

At I» J ) = TEMP 


242 


TEMP*BIL> 


-f??- 


B(L)=B(I) 


^44 


Brn=TEMp 


2^»5 

11 

CC 13 J= IPLUS UN • 




QuoT=A*n, rmrirn 




DO 12K = IPLUS 1,N 


*^48 

12 

A( l,lO=AlJ,K)-QU0T«A(TsK) 



13 

B( J) = 0( J1-QU0T=S=8( n 


#50 


IFIAINsN)} 15»14»15' " 


aS51 

14 

ILL=1 


252 


gy u 


S?53 

15 

X(N) = B(N1/A( N,N ) 

• 

l[!54 


i=N-r 


255 

16 

SUM=Oo 


j-756 


IPLUS1=I+1 


IP? 


CO 17 J= IPLUS IjN 


458 

17 

^0M= S¥‘^ + A n , JT# XTJl 


259 


X( I)=( Bl T V-SUM)/Al U I) 


“1l60~ 


r=i-y * ' 


Lisi 


IFl 1) 18^18, 16 


262 

■ 18 

RETURN 


uw 


END 


IJ 

$DATA 







